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VACCINES 

Technical Field of the Invention 

5 The present invention relates generally to therapy of cancer, such as colon, 

colorectal, breast, bladder, lung and endometrial cancer. The invention is more 
specifically related to polypeptides, comprising at least a portion of a cripto protein 
tumor protein, and to polynucleotides encoding such polypeptides, in particular 
pharmaceutical compositions, e.g., vaccines, and other compositions for the treatment 
10 of cancer that are cripto - expressing carcinomas such as certain non-small long cell 
carcinoma, breast, colon, colorectal cancer. 

Background of the Invention 

CRIPTO is a 188 amino acid protein shares homologies with the epidermal 
15 growth factor (EGF) family (EMBO Journal (1989) Vol 8 (7) ppl987-1991). 

huCRIPTO mRNA is detected only in undifferentiated cells and disappear after 
cell differentiation mCRIPTO is expressed during pregnancy and lactation (induces 
branching morphogenesis in mammary epithelial cells) and is suspected to be an 
autocrine growth factor for normal breast cells. CRIPTO is required for correct 
20 orientation of the anterior-posterior axis in the mouse embryo. 

Human cripto gene has been expressed US 5,654,140 herein incorporated by 
reference in its entirety. Inmiunogentic fragments of CRIPTO are described in WO 
02/16413 and U.S. Patent Application Nos. 10/362597 and 10/407481 herein 
incorporated by reference in their entirety. 

25 Cancer is a significant health problem throughout the world. Although 

advances have been made in detection and therapy of cancer, no vaccine or other 
universally successful method for prevention and/or treatment is currently available. 
Current therapies, which are generally based on a combination of chemotherapy or 
surgery and radiation, continue to prove inadequate in many patients. 

30 Colon cancer is the second most frequently diagnosed malignancy in the United 

Sates as well as the second most common cause of cancer death. An estimated 95,600 
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new cases of colon cancer will have been diagnosed in 1998, with an estimated 47,700 
deaths. The five-year survival rate for patients with colorectal cancer detected in an 
early-localised stage is 92%, unfortunately, only 37% of colorectal cancer is diagnosed 
at this stage, the survival stage. The survival rate drops to 64% if the cancer is allowed 
5 to spread to adjacent organs or lymph nodes, and to 7% in patients with distant 
metastases. 

The prognosis of colon cancer is directly related to the degree of penetration of 
the tumour through the bowel wall, to the level of metastasis, to the presence or 
absence of nodal involvement, consequently, early detection and treatment are 

10 especially important. Currently, diagnosis is aided by the use of screening assays for 
fecal occult blood, sigmoidoscopy, colonoscopy and double contrast barium enemas. 
Treatment regimens are determined by the type and stage of the cancer, and include 
surgery, radiation therapy and/or chemotherapy. Recurrence following surgery (the 
most common form of therapy) is a major problem and is often the ultimate cause of 

15 death. In spite of considerable research into therapies for the disease, colon cancer 
remains difficult to diagnose and treat. Accordingly, there is a need in the art for 
improved methods for treating such cancers. The present invention fulfils these needs 
and further provides other related advantages. 

Breast cancer is a significant health problem for women in the United States and 
20 throughout the world. Although advances have been made in detection and treatment 
of the disease, breast cancer remains the second leading cause of cancer-related deaths 
in women, affecting more than 180,000 women in the United States each year. For 
women in North America, the lifetime odds of getting breast cancer are now one in 
eight. 

25 No vaccine or other universally successful method for the prevention or 

treatment of breast cancer is currently available. Management of the disease currently 
relies on a combination of early diagnosis (through routine breast screening procedures) 
and aggressive treatment which may including one or more of a variety of treatments 
such as surgery, radiotherapy, chemotherapy and hormone therapy. The course of 

30 treatment for a particular breast cancer is often selected based on a variety of prognostic 
parameters, including an analysis of specific tumour markers. See, eg Porter- Jordan 
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and Lippman, Breast Cancer 8:73-100 (1994). However, the use of established markers 
often leads to a result that this is difficult to interpret, and the high mortality observed 
in breast cancer patients indicates that improvements are needed in the treatment and 
prevention of the disease. 

5 Lung cancer is the primary cause of cancer death among both men and women 

in the US, with an estimated 172,000 new cases being reported in 1994. The five-year 
survival rate among all lung cancer patients, regardless of the state of disease at 
diagnosis is only 13%. This contrasts with a five-year survival rate of 46% among 
cases detected while the disease is still localised. However, only 16% of lung cancers 
10 are discovered before the disease has spread. 

In spite of considerable research into therapies for these and other cancers, 
breast, colon and colorectal remains difficult to diagnose and treat effectively. 
Accordingly, there is a need in the art for improved methods for treating and preventing 
such cancers. The present invention fulfills these needs and further provides other 
1 5 related advantages . 

Summary of the Invention 

In one aspect of the present invention, an immunogentic fragment of a Cripto 
polypeptide is provided wherein the immunogenic fragment is immunologically 
reactive with an antibody and/or T-cell that reacts with a polypeptide of SEQ IE) NO: 3 
20 or SEQ ID NO:4, wherein said immunogenic fragment is at least 20 contiguous amino 
acids in length, comprises SEQ ID NO:97 and does not contain SEQ ID NO: 1 1 and 
SEQ ID NO: 12. 

In another aspect of the present invention, a method is provided for stimulating T 
cells specific for Cripto, comprising contacting said cells with an immunogenic 
25 fragment of a Cripto polypeptide wherein the inmiunogenic fragment is 
immunologically reactive with an antibody and/or T-cell that reacts with a full-length 
polypeptide of SEQ ID NO:3 or SEQ ID NO:4, wherein said immunogenic fragment is 
at least 20 contiguous amino acids in length, comprises SEQ ID NO:97 and does not 
contain SEQ ID NO: 1 1 or SEQ ID NO: 12. 
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In another embodiment of the present invention, a method is provided for inhibiting 
the development of a cancer in a patient, comprising the steps of: 

(a) incubating CD4+ and/or CD8+ T cells isolated from a patient with SEQ 
ID NO:97; and 

5 (b) administering to the patient an effective amount of the T cells, and 

thereby inhibiting the development of a cancer in the patient. 

In yet another embodiment, a method for producing an inmiunogenic response to 
Cripto in an animal comprising administering a first component comprising a 
polynucleotide encoding SEQ ID NO:97 and does not encode SEQ ID NO: 3 or SEQ ID 
10 NO:4 to the animal. It will be understood by the skilled artisein that as a result of the 
degeneracy of the genetic code, there are many nucleotide sequences that encode a 
polypeptide as described herein. 

In yet another embodiment, a method is provided for inducing an immunoresponse 
to Cripto in an animed comprising repeatedly administering a composition comprising a 
15 first component comprising SEQ ID NO: 97. 

The polypeptides and/or polynucleotides of the present invention are 
inununogenic, i.e., they are capable of eliciting an immune response, particularly a 
humoral and/or cellular immune response, when if necessary, they are conjugated to a 
suitable carrier and/or adjuvanted. 

20 The present invention further provides fragments, variants and/or derivatives of 

the disclosed polypeptide and/or polynucleotide sequences, wherein the fragments, 
variants and/or derivatives preferably have a level of immunogenic activity of at least 
about 50%, preferably at least about 70% and more preferably at least about 90% of the 
level of immunogenic activity of a polypeptide sequence set forth in SEQ ID No: 2 or 4 

25 or a polypeptide sequence encoded by a polynucleotide sequence set forth in SEQ ID 
NO: 1 or 3. 

The present invention further provides polynucleotides that encode a 
polypeptide described above,, expression vectors comprising such polynucleotides and 
host cells transformed or transfected with such expression vectors. 
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Within other aspects, the present invention provides pharmaceutical 
compositions comprising a polypeptide or polynucleotide as described above and a 
physiologically acceptable carrier. 

Within a related aspect of the present invention, the pharmaceutical 
5 compositions, e.g., vaccine compositions, are provided for prophylactic or therapeutic 
applications. Such compositions generally comprise an immunogenic polypeptide or 
polynucleotide of the invention and an inmiunostimulant, such as an adjuvant. 

Within further aspects, the present invention provides pharmaceutical 
compositions comprising: (a) an antigen presenting cell that expresses a pol5^eptide as 
10 described above and (b) a pharmaceutically acceptable carrier or excipient. Illustrative 
antigen presenting cells include dendritic cells, macrophages, monocytes, fibroblasts 
and B cells. 

Within related aspects, pharmaceutical compositions are provided that 
comprise: (a) an antigen presenting cell that expresses a polypeptide as described 
15 above and (b) an immunostimulant. 

The present invention further provides, in other aspects, fusion proteins that 
comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fusion proteins, typically in the form of pharmaceutical compositions, 
e.g., vaccine compositions, comprising a physiologically acceptable carrier and/or an 
20 immunostimulant. The fusion proteins may comprise multiple immunogenic 
polypeptides or portions/variants thereof, as described herein, and may further comprise 
one or more polypeptide segments for facilitating the expression, purification and/or 
immunogenicity of the polypeptide(s). 

Within further aspects, the present invention provides methods for stimulating 
25 an immune response in a patient, preferably a T cell response in a human patient, 
comprising administering a pharmaceutical composition described herein. The patient 
may be afflicted with lung or colon cancer or colorectal cancer or breast cancer, in 
which case the methods provide treatment for the disease, or patient considered at risk 
for such a disease may be treated prophylactically. In particular the patient will be 
30 afflicted with a tumour expressing cripto antigens. 
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Within further aspects, the present invention provides methods for inhibiting the 
development of a cancer in a patient, comprising administering to a patient a 
pharmaceutical composition as recited above. The patient may be afflicted with lung, 
colon, colorectal or breast cancer, in which case the methods provide treatment for the 
5 disease, or patient considered at risk for such a disease may be treated prophylactically. 

The present invention further provides, within other aspects, methods for 
removing tumor cells from a biological sample, comprising contacting a biological 
sample with T cells that specifically react with a polypeptide of the present invention, 
wherein the step of contacting is performed under conditions and for a time sufficient to 
10 permit the removal of cells expressing the protein from the sample. 

Within related aspects, methods are provided for inhibiting the development of 
a cancer in a patient, comprising administering to a patient a biological sample treated 
as described above. 

Methods are further provided, within other aspects, for stimulating and/or 
15 expanding T cells specific for a polypeptide of the present invention, comprising 
contacting T cells with one or more of: (i) a polypeptide as described above; (ii) a 
polynucleotide encoding such a polypeptide; and/or (iii) an antigen presenting cell that 
expresses such a polypeptide; under conditions and for a time sufficient to permit the 
stimulation and/or expansion of T cells. Isolated T cell populations comprising T cells 
20 prepared as described above are also provided. 

Within further aspects, the present invention provides methods for inhibiting the 
development of a cancer in a patient, comprising administering to a patient an effective 
amount of a T cell population as described above. 

The present invention further provides methods for inhibiting the development 
25 of a cancer in a patient, comprising the steps of: (a) incubating CD4+ and/or CD8+ 
T cells isolated from a patient with one or more of: (i) a polypeptide comprising at 
least an immunogenic portion of a cripto polypeptide disclosed herein; (ii) a 
polynucleotide encoding such a polypeptide; and (iii) an antigen-presenting cell that 
expressed such a polypeptide; and (b) administering to the patient an effective amount 
30 of the proliferated T cells, and thereby inhibiting the development of a cancer in the 
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patient. Proliferated cells may, but need not, be cloned prior to administration to the 
patient. 

These and other aspects of the present invention will become apparent upon 
reference to the following detailed description. All references disclosed herein are 
5 hereby incorporated by reference in their entirety as if each was incorporated 
individually. 

Brief Description of the Drawings and Sequence Identifiers 
SEQ ID NO: 1 Cripto 1 Polynucleotide 

SEQ ID NO: 2 Cripto 3 Polynucleotide 

10 SEQ ID NO: 2 Cripto 3 Polypeptide! 

SEQ ID NO: 3 Cripto 1 Polypeptide 

SEQ ID NO: 4 Cripto 3 Polypeptide 

SEQ ID NO: 5 Cripto Polynucleotide as described in US Patent No.5,654,140 
SEQ ID NO: 6 Cripto Polynucleotide as described in US Patent No. 5,654,140 
15 SEQ ID NOS: 7 -10 PCR primers 

SEQ ID NOS: 1 1 & 12 synthetic Cripto 1 peptides 
SEQ ID NOS: 13-94 Epitopes from Cripto 1 and 3 
SEQ ID NO: 95 Cripto 1 variant Polynucleotide 
SEQ ID NO: 96 Cripto 1 variant Polypeptide 

20 Detailed Description of the Invention 

The present invention is directed generally to compositions and their use in the 
therapy and diagnosis of cancer, particularly cripto expressing cancer and metastases 
including Cripto expressing lung, colon, colorectal and breast cancers. As described 
further below, illustrative compositions of the present invention include, but are not 

25 restricted to, polypeptides, particularly inununogenic polypeptides, polynucleotides 
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encoding such polypeptides, antibodies and other binding agents, antigen presenting 
cells (APCs) and immune system cells (e.g., T cells). 

The practice of the present invention will employ, unless indicated specifically 
to the contrary, conventional methods of virology, immunology, microbiology, 
5 molecular biology and recombinant DNA techniques within the skill of the art, many of 
which are described below for the purpose of illustration. Such techniques are 
explained fully in the literature. See, e.g., Sambrook, et al. Molecular Cloning: A 
Laboratory Manual (2nd Edition, 1989); Maniatis et al. Molecular Cloning: A 
Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & II (D. 
10 Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed., 1984); Nucleic Acid 
Hybridization (B. Hames & S. Higgins, eds., 1985); Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed., 1986); Perbal, 
A Practical Guide to Molecular Cloning (1984). 

All publications, patents and patent applications cited herein, whether supra or 
15 infra, are hereby incorporated by reference in their entirety. 

As used in this specification and the appended claims, the singular forms "a," 
"an" and "the" include plural references unless the content clearly dictates otherwise. 

Polypeptide Compositions 

"Polypeptide(s)" refers to any peptide or protein comprising two or more amino acids 

20 joined to each other by peptide bonds or modified peptide bonds. "Polypeptide(s)" refers to both 
short chains, commonly referred to as peptides, oligopeptides and oligomers and to longer chains 
generally referred to as proteins. Polypeptides may comprise amino acids other than the 20 gene 
encoded amino acids. "Polypeptide(s)" include those modified either by natural processes, such as 
processing and other post-translational modifications, but also by chemical modification 

25 techniques. Such modifications are well described in basic texts and in more detailed 
monographs, as well as in a voluminous research literature, and they are well known to those of 
skill in the art. It will be appreciated that the same type of modification may be present in the 
same or varying degree at several sites in a given polypeptide. Also, a given polypeptide may 
comprise many types of modifications. Modifications can occur anywhere in a polypeptide, 

30 including the peptide backbone, the amino acid side-chains, and the amino or carboxyl termini. 
Modifications include, for example, acetylation, acylation, ADP-ribosylation, amidation, covalent 
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attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or 
nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of 
phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, 
formation of covalent cross-links, formation of cysteine, formation of pyroglutamate, formylation, 
5 gamma-carboxylation, GPI anchor formation, hydroxy lation, iodination, methylation, 
myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, racemization, 
glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, 
hydroxylation and ADP-ribosylation, selenoylation, sulfation, transfer-RNA mediated addition of 
amino acids to proteins, such as arginylation, and ubiquitination. See, for instance, PROTEINS - 

10 STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. H. Freeman and 
Company, New York (1993) and Wold, F., Posttranslational Protein Modifications: Perspectives 
and Prospects, pgs^ 1-12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF 
PROTEINS, B. C. Johnson, Ed., Academic Press, New York (1983); Seifter et al., Meth. EnzymoL 
752:626-646 (1990) and Rattan et al.. Protein Synthesis: Posttranslational Modifications and 

15 Aging, Ann. N.Y. Acad. Sci. 663: 48-62 (1992). Polypeptides may be branched or cyclic, with or 
without branching. Cyclic, branched and branched circular polypeptides may result from post- 
translational natural processes and may be made by entirely synthetic methods, as well. 

Particularly illustrative polypeptides of the present invention comprise a 
sequence of at least 10 contiguous amino acids, preferably 20, more preferably 30, 40, 

20 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180 amino acids of the cirpto 
protein of ID NO: 2 or 4. 

In certain preferred embodiments, the polypeptides of the invention are 
immunogenic, i.e., they react detectably within an immunoassay (such as an ELISA or 
T-cell stimulation assay) with antisera and/or T-cells from a patient with cripto 

25 expressing cancer. Screening for immunogenic activity can be performed using 
techniques well known to the skilled artisan. For example, such screens can be 
performed using methods such as those described in Harlow and Lane, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In one illustrative 
example, a polj^eptide may be immobilized on a solid support and contacted with 

30 patient sera to allow binding of antibodies within the sera to the immobilized 
polypeptide. Unbound sera may then be removed and bound antibodies detected using, 
for example, ^^I-labeled Protein A. 
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As would be recognized by the skilled artisan, immunogenic portions of the 
polypeptides disclosed herein are also encompassed by the present invention. An 
"immunogenic portion," as used herein, is a fragment of an immunogenic polypeptide 
of the invention that itself is iiranunologically reactive (i.e., specifically binds) with the 
5 B-cells and/or T-cell surface antigen receptors that recognize the polypeptide. 
Immunogenic portions may generally be identified using well known techniques, such 
as those summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven 
Press, 1993) and references cited therein. Such techniques include screening 
polypeptides for the ability to react with antigen-specific antibodies, antisera and/or T- 
10 cell lines or clones. As used herein, antisera and antibodies are "antigen-specific" if 
they specifically bind to an antigen (i.e., they react with the protein in an ELISA or 
other inmiunoassay, and do not react detectably with unrelated proteins). Such antisera 
and antibodies may be prepared as described herein, and using well-known techniques. 

In one preferred embodiment, an immunogenic portion of a polypeptide of the 
15 present invention is a portion that reacts with antisera and/or T-cells at a level that is 
not substantially less than the reactivity of the full-length polypeptide (e.g., in an 
ELISA and/or T-cell reactivity assay). Preferably, the level of immunogenic activity of 
the immunogenic portion is at least about 50%, preferably at least about 70% and most 
preferably greater than about 90% of the immunogenicity for the full-length 
20 polypeptide. In some instances, preferred immunogenic portions will be identified that 
have a level of immunogenic activity greater than that of the corresponding full-length 
polypeptide, e.g., having greater than about 100% or 150% or more immunogenic 
activity. 

In certain other embodiments, illustrative immunogenic portions may include 
25 peptides in which an N-terminal leader sequence and/or transmembrane domain have 
been deleted. Other illustrative immunogenic portions will contain a small N- and/or 
C-terminal deletion (e.g., 1-30 amino acids, preferably 5-15 amino acids), relative to 
the mature protein. 

In another embodiment, a polypeptide composition of the invention may also 
30 comprise one or more polypeptides that are immunologically reactive with T cells 
and/or antibodies generated agziinst a polypeptide of the invention, particularly a 
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polypeptide having an amino acid sequence disclosed herein, or to an immunogenic 
fragment or variant thereof. 

In another embodiment of the invention, polypeptides are provided that 
comprise one or more polypeptides that are capable of eliciting T cells and/or 
5 antibodies that are immunologically reactive with one or more polypeptides described 
herein, or one or more polypeptides encoded by contiguous nucleic acid sequences 
contained in the polynucleotide sequences disclosed herein, or immunogenic fragments 
or variants thereof, or to one or more nucleic acid sequences which hybridize to one or 
more of these sequences under conditions of moderate to high stringency. 

10 The present invention, in another aspect, provides polj^eptide fragments ^ 

comprising at least about 5, 10, 15, 20, 25, 50, 100, or 150 contiguous amino acids, or 
more, including all intermediate lengths, of a polypeptide compositions set forth herein, 
such as those set forth in SEQ ID NO: 2 or 4 or those encoded by a polynucleotide 
sequence set forth in a sequence of SEQ ED NO: 1 or 3. It is preferred that the 

15 polypeptides comprise at least one preferably a pluarality of epitopes as set forth in 
sequence ID no 13 to 94. Optionally the frgaments are fused or otherwise conjugated 
to a heterologous carrier. 

In another aspect, the present invention provides variants of the polypeptide 
compositions described herein. Polypeptide variants generally encompassed by the 
20 present invention will typically exhibit at least about 70%, 75%, 80%, 85%, 90%, 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more identity (determined as 
described below), along its length, to a polypeptide sequences set forth herein. 

In one preferred embodiment, the polypeptide fragments and variants provide 
by the present invention are inmiunologically reactive with an antibody and/or T-cell 
25 that reacts with a full-length polypeptide specifically set for the herein. 

In another preferred embodiment, the polypeptide fragments and variants 
provided by the present invention exhibit a level of immunogenic activity of at least 
about 50%, preferably at least about 70%, and most preferably at least about 90% or 
more of that exhibited by a full-length polypeptide sequence specifically set forth 
30 herein. 
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A polypeptide "variant/' as the term is used herein, is a polypeptide that 
typically differs from a polypeptide specifically disclosed herein in one or more 
substitutions, deletions, additions and/or insertions. Such variants may be naturally 
occurring or may be synthetically generated, for example, by modifying one or more of 
5 the above polypeptide sequences of the invention and evaluating their immunogenic 
activity as described herein and/or using any of a number of techniques well known in 
the art. 

For example, certain illustrative variants of the polypeptides of the invention 
include those in which one or more portions, such as an N-terminal leader sequence or 
10 transmembrane domain, have been removed. Other illustrative variants include 
variants in which a small portion (e.g., 1-30 amino acids, preferably 5-15 amino acids) 
has been removed from the N- and/or C-terminal of the mature protein. 

In many instances, a v£iriant will contain conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 

15 amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary structure and hydropathic nature of the 
polypeptide to be substantially unchanged. As described above, modifications may be 
made in the structure of the polynucleotides and polypeptides of the present invention 
and still obtain a functional molecule that encodes a variant or derivative polypeptide 

20 with desirable characteristics, e.g., with immunogenic characteristics. When it is 
desired to alter the amino acid sequence of a polypeptide to create an equivalent, or 
even an improved, immunogenic variant or portion of a polypeptide of the invention, 
one skilled in the art will typically change one or more of the codons of the encoding 
DNA sequence according to Table 1. 

25 For example, certain amino acids may be substituted for other amino acids in a 

protein structure without appreciable loss of interactive binding capacity with structures 
such as, for example, antigen-binding regions of antibodies or binding sites on substrate 
molecules. Since it is the interactive capacity and nature of a protein that defines that 
protein's biological functional activity, certain amino acid sequence substitutions can be 

30 made in a protein sequence, and, of course, its underlying DNA coding sequence, and 
nevertheless obtain a protein with like properties. It is thus contemplated that various 
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changes may be made in the peptide sequences of the disclosed compositions, or 
corresponding DNA sequences which encode said peptides without appreciable loss of 
their biological utility or activity. 



Table 1 



Amino Acids 



Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAG 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


uuu 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


Oe 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


T 

Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


cuu 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










ProUne 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


ucu 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Tip 


W 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











In making such changes, the hydropathic index of amino acids may be 

considered. The importance of the hydropathic amino acid index in conferring 

interactive biologic function on a protein is generally understood in the art (Kyte and 
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Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein with other 
molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
5 the like. Each amino acid has been assigned a hydropathic index on the basis of its 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values 
are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); 
cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (- 
0.7); serine (-0.8); tryptophan (-0,9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); 
10 glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); 
and arginine (^.5). 

It is known in the art that certain amino acids may be substituted by other amino 
acids having a similar hydropathic index or score and still result in a protein with 
similar biological activity, i.e. still obtain a biological functionally equivalent protein. 

15 In making such changes, the substitution of amino acids whose hydropathic indices are 
within +2 is preferred, those within +1 are particularly preferred, and those within ±0.5 
are even more particularly preferred. It is also understood in the art that the substitution 
of like amino acids can be made effectively on the basis of hydrophilicity. U. S. Patent 
4,554,101 (specifically incorporated herein by reference in its entirety), states that the 

20 greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of 
its adjacent amino acids, correlates with a biological property of the protein. 

As detailed in U. S. Patent 4,554,101, the following hydrophilicity values have 
been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 
1); glutamate (+3.0 + 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); 

25 threonine (-0.4); proline (-0.5 + 1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); 
methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); 
phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an immunologically equivalent protein. In 

30 such changes, the substitution of amino acids whose hydrophilicity values are within ±2 
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is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even 
more particularly preferred.. 

As outlined above, amino acid substitutions are generally therefore based on the 
relative similarity of the amino acid side-chain substituents, for example, their 
5 hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that 
take various of the foregoing characteristics into consideration are well known to those 
of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. 

In addition, any polynucleotide may be further modified to increase stability in 
10 vivo. Possible modifications include, but are not limited to, the addition of flanking 
sequences at the 5' and/or 3' ends; the use of phosphorothioate or 2* O-methyl rather 
than phosphodiesterase linkages in the backbone; and/or the inclusion of nontraditional 

bases such as inosine, queosine and wybutosine, as well as acetyl- methyl-, thio- and 
other modified forms of adenine, cytidine, guanine, thymine and uridine. 

15 Amino acid substitutions may further be made on the basis of similarity in 

polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic 
nature of the residues. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; and 
amino acids with uncharged polar head groups having similar hydrophilicity values 

20 include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; 
and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that 
may represent conservative changes include: (l)ala, pro, gly, glu, asp, gin, asn, ser, 
thr; (2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, 
trp, his. A variant may also, or alternatively, contain nonconservative changes. In a 

25 preferred embodiment, variant polypeptides differ from a native sequence by 
substitution, deletion or addition of five amino acids or fewer. Variants may also (or 
alternatively) be modified by, for example, the deletion or addition of amino acids that 
have minimal influence on the immunogenicity, secondary structure and hydropathic 
nature of the polypeptide. 

30 As noted above, polypeptides may comprise a signal (or leader) sequence at the 

N-terminal end of the protein, which co-translationally or post-translationally directs 
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transfer of the protein. The polypeptide may also be conjugated to a linker or other 
sequence for ease of synthesis, purification or identification of the polypeptide (e.g., 
poly-His), or to enhance binding of the polypeptide to a solid support. For example, a 
polypeptide may be conjugated to an inununoglobulin Fc region. 

5 When comparing polypeptide sequences, two sequences are said to be 

"identical" if the sequence of amino acids in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
10 window" as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using the 
15 Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, Inc., 
Madison, WI), using default parameters. This program embodies several alignment 
schemes described in the following references: Dayhoff, M.O. (1978) A model of 
evolutionary change in proteins - Matrices for detecting distant relationships. In 
Dayhoff, M.O, (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
20 Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and MuUer W. (1988) CABIOS 4:11-17; Robinson, 
E.D. (1971) Comb. Theor 11:105; Santou, N. Nes, M. (1987) Mol. Biol. EvoL 4:406- 
25 425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. cind 
Lipman, D.J. (1983) Proc. Natl. Acad., Sci. USA 80:726-730. 

Altematively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. 
30 Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol. Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
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Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin 
Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., 
Madison, WI), or by inspection. 

5 One preferred example of algorithms that are suitable for determining percent 

sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, 
which are described in Altschul et al. (1977) NucL Acids Res. 25:3389-3402 and 
Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. BLAST and BLAST 2.0 
can be used, for example with the parameters described herein, to determine percent 

10 sequence identity for the polynucleotides and polypeptides of the invention. Software 
for performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information. For amino acid sequences, a scoring matrix can be used to 
calculate the cumulative score. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X from its maximum 

15 achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of either sequence is 
reached. The BLAST algorithm parameters W, T and X determine the sensitivity and 
speed of the alignment. 

In one preferred approach, the "percentage of sequence identity" is determined 
20 by comparing two optimally aligned sequences over a window of comparison of at least 
20 positions, wherein the portion of the polypeptide sequence in the comparison 
window may comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 
to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
25 percentage is calculated by determining the number of positions at which the identical 
amino acid residue occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total number of positions in the 
reference sequence (i.e., the window size) and multiplying the results by 100 to yield 
the percentage of sequence identity. 

30 Within other illustrative embodiments, a polypeptide may be a fusion 

polypeptide that comprises multiple polypeptides as described herein, or that comprises 
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at least one polypeptide as described herein and an unrelated sequence, such as a 
known tumor protein. A fusion partner may, for example, assist in providing T helper 
epitopes (an inmiunological fusion partner), preferably T helper epitopes recognized by 
humans, or may assist in expressing the protein (an expression enhancer) at higher 
5 yields than the native recombinant protein. Certain preferred fusion partners are both 
immunological and expression enhancing fusion partners. Other fusion partners may 
be selected so as to increase the solubility of the polypeptide or to enable the 
polypeptide to be targeted to desired intracellular compartments. Still further fusion 
partners include affinity tags, which facilitate purification of the polypeptide. 

10 Fusion polypeptides may generally be prepared using standard techniques, 

including chemical conjugation. Preferably, a fusion polypeptide is expressed as a 
recombinant polypeptide, allowing the production of increased levels, relative to a non- 
fiised polypeptide, in an expression system. Briefly, DNA sequences encoding the 
polypeptide components may be assembled separately, and ligated into an appropriate 

15 expression vector. The 3' end of the DNA sequence encoding one polypeptide 
component is ligated, with or without a peptide linker, to the 5' end of a DNA sequence 
encoding the second polypeptide component so that the reading freunes of the 
sequences are in phase. This permits translation into a single fusion polypeptide that 
retains the biological activity of both component polypeptides. 

20 A peptide linker sequence may be employed to separate the first and second 

polypeptide components by a distance sufficient to ensure that each polypeptide folds 
into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion polypeptide using standard techniques well known in the 
art. Suitable peptide linker sequences may be chosen based on the following factors: 

25 (1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 
secondary structure that could interact with functional epitopes on the first and second 
polypeptides; and (3) the lack of hydrophobic or charged residues that might react with 
the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 

30 used in the linker sequence. Amino acid sequences which may be usefully employed as 

linkers include those disclosed in Maratea et al.. Gene 40:39-46, 1985; Murphy et al., 

Proc. Nad. Acad. Sci. USA 83:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. 

- 18- 



VU60111 

Patent No. 4,751,180. The linker sequence may generdly be from 1 to about 50 amino 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to 
separate the functional domains and prevent steric interference. 

5 The ligated DNA sequences are operably linked to suitable transcriptional or 

translational regulatory elements. The regulatory elements responsible for expression 
of DNA are located only 5' to the DNA sequence encoding the first polypeptides. 
Similarly, stop codons required to end translation and tremscription termination signals 
are only present 3 'to the DNA sequence encoding the second polypeptide. 

10 The fusion polypeptide can comprise a polypeptide as described herein together 

with an unrelated inmiunogenic protein, such as an immunogenic protein capable of 
eliciting a recall response. Examples of such proteins include tetanus, tuberculosis and 
hepatitis proteins (see, for example, Stoute et al. New Engl. J. Med., 336:86-91, 1997). 

In one preferred embodiment, the immunological fusion partner is derived from 

15 a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 fragment. 
Ral2 compositions and methods for their use in enhancing the expression and/or 
inmiunogenicity of heterologous polynucleotide/polypeptide sequences is described in 
U.S. Patent Application 60/158,585, the disclosure of which is incorporated herein by 
reference in its entirety. Briefly, Ral2 refers to a polynucleotide region that is a 

20 subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. MTB32A is a 
serine protease of 32 KD molecular weight encoded by a gene in virulent and avirulent 
strains of M. tuberculosis. The nucleotide sequence and amino acid sequence of 
MTB32A have been described (for example, U.S. Patent Application 60/158,585; see 
also, Skeiky et al.. Infection and Immun. (1999) 67:3998-4007, incorporated herein by 

25 reference). C-terminal fragments of the MTB32A coding sequence express at high 
levels and remain as a soluble polypeptides throughout the purification process. 
Moreover, Ral2 may enhance the immunogenicity of heterologous inmiunogenic 
polypeptides with which it is fused. One preferred Ral2 fusion polypeptide comprises 
a 14 KD C-terminal fragment corresponding to amino acid residues 192 to 323 of 

30 MTB32A. Other preferred Ral2 polynucleotides generally comprise at least about 15 
consecutive nucleotides, at least about 30 nucleotides, at least about 60 nucleotides, at 
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least about 100 nucleotides, at least about 200 nucleotides, or at least about 300 
nucleotides that encode a portion of a Ral2 polypeptide. Ral2 polynucleotides may 
comprise a native sequence (i.e., an endogenous sequence that encodes a Ral2 
polypeptide or a portion thereof) or may comprise a variant of such a sequence. Ral2 
5 polynucleotide variants may contain one or more substitutions, additions, deletions 
and/or insertions such that the biological activity of the encoded fusion polypeptide is 
not substantially diminished, relative to a fusion polypeptide comprising a native Ral2 
polypeptide. Variants preferably exhibit at least about 70% identity, more preferably at 
least about 80% identity and most preferably at least about 90% identity to a 
10 polynucleotide sequence that encodes a native Ral2 polypeptide or a portion thereof. 

Within other preferred embodiments, an immunological fusion partner is 
derived from protein D, a surface protein of the gram-negative bacterium Haemophilus 
influenza B (WO 91/18926). Preferably, a protein D derivative comprises 
approximately the first third of the protein (e.g., the first N-terminal 100-110 amino 

15 acids), and a protein D derivative may be lipidated. Within certain preferred 
embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the 
N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to 
increase the expression level in E. coli (thus functioning as an expression enhancer). 
The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. 

20 Other fusion partners include the non-structural protein from influenzae virus, NSl 
(hemaglutinin). Typically, the N-terminal 81 amino acids are used, although different 
fragments that include T-helper epitopes may be used. 

In another embodiment, the inmiunological fusion partner is the protein known 
as LYTA, or a portion thereof (preferably a C-terminal portion). LYTA is derived from 

25 Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine amidase known as 
amidase LYTA (encoded by the LytA gene; Gene 43:265-292, L986). LYTA is an 
autolysin that specifically degrades certain bonds in the peptidoglycan backbone. The 
C-terminal domain of the LYTA protein is responsible for the affinity to the choline or 
to some choline analogues such as DEAE. This property has been exploited for the 

30 development of E. coli C-LYTA expressing plasmids useful for expression of fusion 

proteins. Purification of hybrid proteins containing the C-LYTA fragment at the amino 

terminus has been described (see Biotechnology 10:795-798, 1992). Within a preferred 
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embodiment, a repeat portion of LYTA may be incorporated into a fusion polypeptide. 
A repeat portion is found in the C-terminal region starting at residue 178. A 
particularly preferred repeat portion incorporates residues 188-305. 

Yet another illustrative embodiment involves fusion polypeptides, and the 
5 polynucleotides encoding them, wherein the fusion partner comprises a targeting signal 
capable of directing a polypeptide to the endosomal/lysosomal compartment, as 
described in U.S. Patent No. 5,633,234. An immunogenic polypeptide of the invention, 
when fused with this targeting signal, will associate more efficiently with MHC class n 
molecules and thereby provide enhanced in vivo stimulation of CD4+ T-cells specific 
10 for the polypeptide. 

The cripto part of the fusion molecule may prefreably be the whole length 188 
aa protein of Cripto 1 or Cripto 3 or a fragment thereof as described herein. 

Polypeptides of the invention are prepared using any of a variety of well known 
synthetic and/or recombinant techniques, the latter of which are further described 

15 below. Polypeptides, portions and other variants generally less than about 150 amino 
acids can be generated by synthetic means, using techniques well known to those of 
ordinary skill in the art. In one illustrative example, such polypeptides are synthesized 
using any of the commercially available solid-phase techniques, such as the Merrifield 
solid-phase synthesis method, where amino acids are sequentially added to a growing 

20 amino acid chain. See Merrifield, J. Am. Chem. Soc. 85:2149-2146, 1963. Equipment 
for automated synthesis of polypeptides is commercially available from suppliers such 
as Perkin Elmer/Applied BioSystems Division (Foster City, CA), and may be operated 
according to the manufacturer's instructions. 

In general, polypeptide compositions (including fusion polj^eptides) of the 
25 invention are isolated. An "isolated" polypeptide is one that is removed from its 
original environment. For example, a naturally-occurring protein or polypeptide is 
isolated if it is separated from some or all of the coexisting materials in the natural 
system. Preferably, such polypeptides are also purified, e.g., are at least about 90% 
pure, more preferably at least about 95% pure and most preferably at least about 99% 
30 pure. 
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Polynucleotide Compositions 

•*Polynucleotide(s)*' generally refers to any polyribonucleotide or 
polydeoxyribonucleotide, that may be unmodified RNA or DNA or modified RNA or 
DNA. "Polynucleotide(s)" include, without limitation, single- and double-stranded 
5 DNA, DNA that is a mixture of single- and double-stranded regions or single-, double- 
and triple-stranded regions, single- and double-stranded RNA, and RNA that is mixture 
of single- and double-stranded regions, hybrid molecules comprising DNA and RNA 
that may be single-stranded or, more typically, double-stranded, or triple-stranded 
regions, or a mixture of single- and double-stranded regions. In addition, 

10 "polynucleotide" as used herein refers to triple-stranded regions comprising RNA or 
DNA or both RNA and DNA. The strands in such regions may be from the same 
molecule or from different molecules. The regions may include all of one or more of 
the molecules, but more typically involve only a region of some of the molecules. One 
of the molecules of a triple-helical region often is an oligonucleotide. As used herein, 

15 the term "polynucleotide(s)" also includes DNAs or RNAs as described above that 
comprise one or more modified bases. Thus, DNAs or RNAs with backbones modified 
for stability or for other reasons are "polynucleotide(s)" as that term is intended herein. 
Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified 
bases, such as tritylated bases, to name just two examples, are polynucleotides as the 

20 term is used herein. It will be appreciated that a great variety of modifications have 
been made to DNA and RNA that serve many useful purposes known to those of skill 
in the art. The term "polynucleotide(s)" as it is employed herein embraces such 
chemically, enzymatically or metabolically modified forms of polynucleotides, as well 
as the chemical forms of DNA and RNA characteristic of viruses and cells, including, 

25 for example, simple and complex cells. "Polynucleotide(s)" also embraces short 
polynucleotides often referred to as oligonucleotide(s). 

The present invention, in other aspects, provides polynucleotide compositions 
that encode for the polypeptides of the invention. The terms "DNA" and 
"polynucleotide" are used essentially interchangeably herein to refer to a DNA 
30 molecule that has been isolated free of total genomic DNA of a particular species. 
"Isolated," as used herein, means that a polynucleotide is substantially away from other 
coding sequences, and that the DNA molecule does not contain large portions of 

-22- 



VU60111 



unrelated coding DNA, such as large chromosomal fragments or other functional genes 
or polypeptide coding regions. Of course, this refers to the DNA molecule as originally 
isolated, and does not exclude genes or coding regions later added to the segment by 
the hand of man. 

5 As will be understood by those skilled in the art, the polynucleotide 

compositions of this invention can include genomic sequences, extra-genomic and 
plasmid-encoded sequences and smaller engineered gene segments that express, or may 
be adapted to express, proteins, polypeptides, peptides and the like. Such segments 
may be naturally isolated, or modified synthetically by the hand of man. 

10 As will be also recognized by the skilled artisan, polynucleotides of the 

invention may be single-stranded (coding or antisense) or double-stranded, and may be 
DNA (genomic, cDNA or synthetic) or RNA molecules. RNA molecules may include 
HnRNA molecules, which contain introns and correspond to a DNA molecule in a one- 
to-one manner, and mRNA molecules, which do not contain introns. Additional coding 

15 or non-coding sequences may, but need not, be present within a polynucleotide of the 
present invention, and a polynucleotide may, but need not, be linked to other molecules 
and/or support materials. 

Polynucleotides may comprise a native sequence (i.e., an endogenous sequence 
that encodes a polypeptide/protein of the invention or a portion thereof) or may 
20 comprise a sequence that encodes a variant or derivative, preferably and immunogenic 
variant or derivative, of such a sequence. 

Typically, polynucleotide variants will contain one or more substitutions, 
additions, deletions and/or insertions, preferably such that the immunogenicity of the 
polj^eptide encoded by the variant polynucleotide is not substantially diminished 
25 relative to a poljrpeptide encoded by a polynucleotide sequence specifically set forth 
herein). The term "variants" should also be understood to encompasses homologous 
genes of xenogenic origin. 

In additional embodiments, the present invention provides polynucleotide 
fragments comprising various lengths of contiguous stretches of sequence identical to 
30 or complementary to one or more of the SEQ ID NO:l or 3. For example, 
polynucleotides are provided by this invention that comprise at least about 10, 15, 20, 
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30, 40, 50, 75, 100, 150, 200, 300, 400, the SEQ ID NO: 1 as well as all intermediate 
lengths there between. It will be readily understood that "intermediate lengths", in this 
context, means any length between the quoted values, such as 16, 17, 18, 19, etc.; 21, 
22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, etc.; 150, 151, 152, 
5 153, etc.; including all integers through 200-500; 500-1,000, and the like. Particularly 
preferred polynucleotides are those which encode the epitopes as set forth in SEQ ID 
NOS: 13-94. 

In another embodiment of the invention, polynucleotide compositions are 
provided that are capable of hybridizing under moderate to high stringency conditions 

10 to a polynucleotide sequence provided herein, or a fragment thereof, or a 
complementary sequence thereof. Hybridization techniques are well known in the art 
of molecular biology. For purposes of illustration, suitable moderately stringent 
conditions for testing the hybridization of a polynucleotide of this invention with other 
polynucleotides include prewashing in a solution of 5 X SSC, 0.5% SDS, 1.0 mM 

15 EDTA (pH 8.0); hybridizing at 50''C-60''C, 5 X SSC, overnight; followed by washing 
twice at 65''C for 20 minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% 
SDS. One skilled in the art will understand that the stringency of hybridization can be 
readily manipulated, such as by altering the salt content of the hybridization solution 
and/or the temperature at which the hybridization is performed. For example, in 

20 another embodiment, suitable highly stringent hybridization conditions include those 
described above, with the exception that the temperature of hybridization is increased, 
e.g., to 60-65oC or 65-70oC. 

In certain preferred embodiments, the polynucleotides described above, e.g., 
polynucleotide variants, fragments and hybridizing sequences, encode polypeptides that 
25 are immunologically cross-reactive with a polypeptide sequence specifically set forth in 
SEQ ID NO: 1 or SEQ ID NO:3. In other preferred embodiments, such polynucleotides 
encode polypeptides that have a level of immunogenic activity of at least about 50%, 
preferably at least about 70%, and more preferably at least about 90% of that for a 
polypeptide sequence specifically set forth herein. 

30 The polynucleotides of the present invention, or fragments thereof, regardless of 

the length of the coding sequence itself, may be combined with other DNA sequences, 
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such as promoters, polyadenylation signals, additional restriction enzyme sites, multiple 
cloning sites, other coding segments, and the like, such that their overall length may 
vary considerably. It is therefore contemplated that a nucleic acid fragment of almost 
any length may be employed, with the total length preferably being limited by the ease 
5 of preparation and use in the intended recombinant DNA protocol. For example, 
illustrative polynucleotide segments with total lengths of about 10,000, about 5000, 
about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 base 
pairs in length, and the like, (including all intermediate lengths) are contemplated to be 
useful in many implementations of this invention. 

10 When comparing polynucleotide sequences, two sequences are said to be 

"identical" if the sequence of nucleotides in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 

15 window" as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using the 

20 Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, Inc., 

Madison, WI), using default parameters. This program embodies several alignment 

schemes described in the following references: Dayhoff, M.O. (1978) A model of 

evolutionary change in proteins - Matrices for detecting distant relationships. In 

Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 

25 Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 

Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 

vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 

CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; Robinson, 

E.D. (1971) Comb. Theor 11:105; Santou, N. Nes, M. (1987) Mol. Biol. Evol. 4:406- 

30 425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 

Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 

Lipman, D.J. (1983) Proc. Natl. Acad., Sci. USA 80:726-730. 
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Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol. Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
5 Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin 
Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., 
Madison, WI), or by inspection. 

One preferred example of algorithms that are suitable for determining percent 

10 sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, 
which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402 and 
Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. BLAST and BLAST 2.0 
can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides of the invention. Software for performing 

15 BLAST analyses is publicly available through the National Center for Biotechnology 
Information. In one illustrative example, cumulative scores can be calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0) and N (penalty score for mismatching residues; always <0). Extension of 
the word hits in each direction are halted when: the cumulative alignment score falls off 

20 by the quantity X from its maximum achieved value; the cumulative score goes to zero 
or below, due to the accumulation of one or more negative-scoring residue alignments; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T and 
X determine the sensitivity and speed of the alignment. The BLASTN program (for 
nucleotide sequences) uses as defaults a wordlength (W) of 11, and expectation (E) of 

25 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. 
Acad. Sci. USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 
and a comparison of both strands. 

^Preferably, the "percentage of sequence identity" is determined by comparing 

two optimally aligned sequences over a window of comparison of at least 20 positions, 

30 wherein the portion of the polynucleotide sequence in the comparison window may 

comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 to 15 percent, 

or 10 to 12 percent, as compared to the reference sequences (which does not comprise 
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additions or deletions) for optimal alignment of the two sequences. The percentage is 
calculated by determining the number of positions at which the identical nucleic acid 
bases occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the reference sequence 
5 (i.e., the window size) and multiplying the results by 100 to yield the percentage of 
sequence identity. 

It will be appreciated by those of ordinary skill in the art that, as a result of the 
degeneracy of the genetic code, there are many nucleotide sequences that encode a 
polypeptide as described herein. Some of these polynucleotides bear minimal 

10 homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides 
that vary due to differences in codon usage are specifically contemplated by the present 
invention. Further, alleles of the genes comprising the polynucleotide sequences 
provided herein are within the scope of the present invention. Alleles are endogenous 
genes that are altered as a result of one or more mutations, such as deletions, additions 

15 and/or substitutions of nucleotides. The resulting mRNA and protein may, but need 
not, have an altered structure or function. Alleles may be identified using standard 
techniques (such as hybridization, amplification and/or database sequence comparison). 

Therefore, in another embodiment of the invention, a mutagenesis approach, 
such as site-specific mutagenesis, is employed for the preparation of immunogenic 

20 variants and/or derivatives of the polypeptides described herein. By this approach, 
specific modifications in a polypeptide sequence can be made through mutagenesis of 
the underlying polynucleotides that encode them. These techniques provides a 
straightforward approach to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more 

25 nucleotide sequence changes into the polynucleotide. 

Site-specific mutagenesis allows the production of mutants through the use of 
specific oligonucleotide sequences which encode the DNA sequence of the desired 
mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer 
sequence of sufficient size and sequence complexity to form a stable duplex on both 
30 sides of the deletion junction being traversed. Mutations may be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise 
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change the properties of the polynucleotide itself, and/or alter the properties, activity, 
composition, stability, or primary sequence of the encoded polypeptide. 

In certain embodiments of the present invention, the inventors contemplate the 
mutagenesis of the disclosed polynucleotide sequences to alter one or more properties 
5 of the encoded polypeptide, such as the immunogenicity of a polypeptide vaccine. The 
techniques of site-specific mutagenesis are well-known in the £irt, and are widely used 
to create variants of both polypeptides and polynucleotides. For example, site-specific 
mutagenesis is often used to alter a specific portion of a DNA molecule. In such 
embodiments, a primer comprising typically about 14 to about 25 nucleotides or so in 
10 length is employed, with about 5 to about 10 residues on both sides of the junction of 
the sequence being altered. 

As will be appreciated by those of skill in the art, site-specific mutagenesis 
techniques have often employed a phage vector that exists in both a single stranded and 
double stranded form. Typical vectors useful in site-directed mutagenesis include 
15 vectors such as the M13 phage. These phage are readily commercially-available and 
their use is generally well-known to those skilled in the art. Double-stranded plasmids 
are also routinely employed in site directed mutagenesis that eliminates the step of 
transferring the gene of interest from a plasmid to a phage. 

In general, site-directed mutagenesis in accordance herewith is performed by 
20 first obtaining a single-stranded vector or melting apart of two strands of a 
double-stranded vector that includes within its sequence a DNA sequence that encodes 
the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is 
prepared, generally synthetically. This primer is then annealed with the single-stranded 
vector, and subjected to DNA polymerizing enzymes such as E, coli polymerase I 
25 Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. This heteroduplex vector is 
then used to transform appropriate cells, such as E. coli cells, and clones are selected 
which include recombinant vectors bearing the mutated sequence arrangement. 

30 The preparation of sequence variants of the selected peptide-encoding DNA 

segments using site-directed mutagenesis provides a means of producing potentially 
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useful species and is not meant to be limiting as there are other ways in which sequence 
variants of peptides and the DNA sequences encoding them may be obtained. For 
example, recombinant vectors encoding the desired peptide sequence may be treated 
with mutagenic agents, such as hydroxylamine, to obtain sequence variants. Specific 
5 details regarding these methods and protocols are found in the teachings of Maloy 
etal., 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and Maniatis et al., 
1982, each incorporated herein by reference, for that purpose. 

As used herein, the term "oligonucleotide directed mutagenesis procedure" 
refers to template-dependent processes and vector-mediated propagation which result in 

10 an increase in the concentration of a specific nucleic acid molecule relative to its initial 
concentration, or in an increase in the concentration of a detectable signal, such as 
amplification. As used herein, the term "oligonucleotide directed mutagenesis 
procedure" is intended to refer to a process that involves the template-dependent 
extension of a primer molecule. The term template dependent process refers to nucleic 

15 acid synthesis of an RNA or a DNA molecule wherein the sequence of the newly 
synthesized strand of nucleic acid is dictated by the well-known rules of 
complementary base pairing (see, for example, Watson, 1987). Typically, vector 
mediated methodologies involve the introduction of the nucleic acid fragment into a 
DNA or RNA vector, the clonal amplification of the vector, and the recovery of the 

20 amplified nucleic acid fragment. Examples of such methodologies are provided by U. 
S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 

In another approach for the production of polypeptide variants of the present 
invention, recursive sequence recombination, as described in U.S. Patent No. 
5,837,458, may be employed. In this approach, iterative cycles of recombination and 
25 screening or selection are performed to "evolve" individual polynucleotide variants of 
the invention having, for example, enhanced immunogenic activity. 

According to another embodiment of the present invention, polynucleotide 
compositions comprising antisense oligonucleotides are provided. Antisense 
oligonucleotides have been demonstrated to be effective and targeted inhibitors of 
30 protein synthesis, and, consequently, provide a therapeutic approach by which a disease 
can be treated by inhibiting the synthesis of proteins that contribute to the disease. The 
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efficacy of antisense oligonucleotides for inhibiting protein synthesis is well 
established. For example, the synthesis of polygalactauronase and the muscarine type 2 
acetylcholine receptor are inhibited by antisense oligonucleotides directed to their 
respective mRNA sequences (U.S. Patent 5,739,119 and U.S. Patent 5,759,829). 
5 Further, examples of antisense inhibition have been demonstrated with the nuclear 
protein cyclin, the multiple drug resistance gene (MDGl), ICAM-1, E-selectin, STK-1, 
striatal GABAA receptor and human EOF (Jaskulski et al.. Science. 1988 Jun 
10;240(4858): 1544-6; Vasanthakumar and Ahmed, Cancer Commun. 1989;1(4):225- 
32; Peris et al.. Brain Res Mol Brain Res. 1998 Jun 15;57(2):3 10-20; U. S, Patent 
10 5,801,154; U.S. Patent 5,789,573; U. S. Patent 5,718,709 and U.S. Patent 5,610,288). 
Antisense constructs have also been described that inhibit and can be used to treat a 
variety of abnormal cellular proliferations, e.g. cancer (U.S. Patent 5,747,470; U.S. 
Patent 5,591,317 and U.S. Patent 5,783,683). 

Therefore, in certain embodiments, the present invention provides 
15 oligonucleotide sequences that comprise all, or a portion of, any sequence that is 
capable of specifically binding to polynucleotide sequence described herein, or a 
complement thereof. In one embodiment, the antisense oligonucleotides comprise 
DNA or derivatives thereof. In another embodiment, the oligonucleotides comprise 
RNA or derivatives thereof. In a third embodiment, the oligonucleotides are modified 
20 DNAs comprising a phosphorothioated modified backbone. In a fourth embodiment, 
the oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In 
each case, preferred compositions comprise a sequence region that is complementary, 
and more preferably substantially-complementary, and even more preferably, 
completely complementary to one or more portions of polynucleotides disclosed herein. 

25 Selection of antisense compositions specific for a given gene sequence is based 

upon analysis of the chosen target sequence (i.e. in these illustrative examples the rat 
and human sequences) and determination of secondary structure, Tm, binding energy, 
relative stability, and antisense compositions were selected based upon their relative 
inability to form dimers, hairpins, or other secondary structures that would reduce or 

30 prohibit specific binding to the target mRNA in a host cell. 
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Highly preferred target regions of the mRNA, are those which are at or near the 
AUG translation initiation codon, and those sequences which are substantially 
complementary to 5' regions of the mRNA. These secondary structure analyses and 
target site selection considerations can be performed, for example, using v.4 of the 
5 OLIGO primer analysis software and/or the BLASTN 2.0.5 algorithm software 
(Altschul et al„ Nucleic Acids Res. 1997 Sep l;25(17):3389-402). 

The use of an antisense delivery method employing a short peptide vector, 
termed MPG (27 residues), is also contemplated. The MPG peptide contains a 
hydrophobic domain derived from the fusion sequence of HIV gp41 and a hydrophilic 

10 domain from the nuclear localization sequence of SV40 T-antigen (Morris et al.. 
Nucleic Acids Res. 1997 Jul 15;25(14):2730-6). It has been demonstrated that several 
molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered 
into cultured mammalian cells in less than 1 hour with relatively high efficiency (90%). 
Further, the interaction with MPG strongly increases both the stability of the 

15 oligonucleotide to nuclease and the ability to cross the plasma membrane. 

According to another embodiment of the invention, the polynucleotide 
compositions described herein are used in the design and preparation of ribozyme 
molecules for inhibiting expression of the tumor polypeptides and proteins of the 
present invention in tumor cells. Ribozymes are RNA-protein complexes that cleave 

20 nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that 
possess endonuclease activity (Kim and Cech, Proc Natl Acad Sci USA. 1987 
Dec;84(24):8788-92; Forster and Symons, Cell. 1987 Apr 24;49(2):2 11-20). For 
example, a large number of ribozymes accelerate phosphoester transfer reactions with a 
high degree of specificity, often cleaving only one of several phosphoesters in an 

25 oligonucleotide substrate (Cech et al.. Cell. 1981 Dec;27(3 Pt 2):487-96; Michel and 
Westhof, J Mol BioL 1990 Dec 5;216(3):585-610; Reinhold-Hurek and Shub, Nature. 
1992 May 14;357(6374): 173-6). This specificity has been attributed to the requirement 
that the substrate bind via specific base -pairing interactions to the internal guide 
sequence ("IGS") of the ribozyme prior to chemical reaction. 

30 Six basic varieties of naturally-occurring enzymatic RNAs are known presently. 

Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and thus can 
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cleave other RNA molecules) under physiological conditions. In general, enzymatic 
nucleic acids act by first binding to a target RNA. Such binding occurs through the 
target binding portion of a enzymatic nucleic acid which is held in close proximity to 
an enzymatic portion of the molecule that acts to cleave the target RNA. Thus, the 
5 enzymatic nucleic acid first recognizes and then binds a target RNA through 
complementary base-pairing, and once bound to the correct site, acts enzymatically to 
cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to 
direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and 
cleaved its RNA target, it is released from that RNA to search for another target and 
10 can repeatedly bind and cleave new targets. 

The enzymatic nature of a ribozyme is advantageous over many technologies, 
such as antisense technology (where a nucleic acid molecule simply binds to a nucleic 
acid target to block its translation) since the concentration of ribozyme necessary to 
affect a therapeutic treatment is lower than that of an antisense oligonucleotide. This 

15 advantage reflects the ability of the ribozyme to act enzymatically. Thus, a single 
ribozyme molecule is able to cleave many molecules of target RNA. In addition, the 
ribozyme is a highly specific inhibitor, with the specificity of inhibition depending not 
only on the base pairing mechanism of binding to the target RNA, but also on the 
mechanism of target RNA cleavage. Single mismatches, or base-substitutions, near the 

20 site of cleavage can completely eliminate catalytic activity of a ribozyme. Similar 
mismatches in antisense molecules do not prevent their action (Woolf et al., Proc Natl 
Acad Sci USA. 1992 Aug 15;89(16):7305-9). Thus, the specificity of action of a 
ribozyme is greater than that of an antisense oligonucleotide binding the same RNA 
site. 

25 The enzymatic nucleic acid molecule may be formed in a hanmierhead, hairpin, 

a hepatitis 5 virus, group I intron or RNaseP RNA (in association with an RNA guide 
sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are 
described by Rossi et al. Nucleic Acids Res. 1992 Sep 1 1;20(17):4559-65. Examples 
of hairpin motifs are described by Hampel et al. (Eur. Pat. Appl. Publ. No. EP 

30 0360257), Hampel and Tritz, Biochemistry 1989 Jun 13;28(12):4929-33; Hampel et al.. 
Nucleic Acids Res. 1990 Jan 25;18(2):299-304 and U. S. Patent 5,631,359. An 
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example of the hepatitis 5 viras motif is described by Perrotta and Been, Biochemistry. 
1992 Dec 1;31(47): 11843-52; an example of the RNaseP motif is described by 
Guerrier-Takada et al.. Cell. 1983 Dec;35(3 Ft 2):849-57; Neurospora VS RNA 
ribozyme motif is described by Collins (Saville and Collins, Cell. 1990 May 
5 18;61(4):685-96; Saville and Collins, Proc Natl Acad Sci USA. 1991 Oct 
l;88(19):8826-30; Collins and Olive, Biochemistry. 1993 Mar 23;32(1 1):2795-9); and 
an example of the Group I intron is described in (U. S. Patent 4,987,071). All that is 
important in an enzymatic nucleic acid molecule of this invention is that it has a 
specific substrate binding site which is complementary to one or more of the target 
10 gene RNA regions, and that it have nucleotide sequences within or surrounding that 
substrate binding site which impart an RNA cleaving activity to the molecule. Thus the 
ribozyme constructs need not be limited to specific motifs mentioned herein. 

Ribozymes may be designed as described in Int. Pat. Appl. Publ. No. WO 
93/23569 and Int. Pat. Appl. Publ. No. WO 94/02595, each specifically incorporated 
15 herein by reference) and synthesized to be tested in vitro and in vivo, as described. 
Such ribozymes can also be optimized for delivery. While specific examples are 
provided, those in the art will recognize that equivalent RNA targets in other species 
can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of the ribozyme 
20 binding arms, or chemically synthesizing ribozymes with modifications that prevent 
their degradation by serum ribonucleases (see e.g.. Int. Pat. Appl. Publ. No. WO 
92/07065; Int. Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl. Publ. No. WO 
91/03162; Eur. Pat. Appl. Publ. No. 92110298.4; U. S. Patent 5,334,711; and Int. Pat. 
Appl. Publ. No. WO 94/13688, which describe various chemical modifications that can 
25 be made to the sugar moieties of enzymatic RNA molecules), modifications which 
enhance their efficacy in cells, and removal of stem II bases to shorten RNA synthesis 
times and reduce chemical requirements: 

Sullivan et al. (Int. Pat. Appl. Publ. No. WO 94/02595) describes the general 
methods for delivery of enzymatic RNA molecules. Ribozymes may be administered 
30 to cells by a variety of methods known to those familiar to the art, including, but not 
restricted to, encapsulation in liposomes, by iontophoresis, or by incorporation into 
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other vehicles, such as hydrogels, cyclodextrins, biodegradable nanocapsules, and 
bioadhesive microspheres. For some indications, ribozymes may be directly delivered 
ex vivo to cells or tissues with or without the aforementioned vehicles. Alternatively, 
the RNA/vehicle combination may be locally delivered by direct inhalation, by direct 
5 injection or by use of a catheter, infusion pump or stent. Other routes of delivery 
include, but are not limited to, intravascular, intramuscular, subcutaneous or joint 
injection, aerosol inhalation, oral (tablet or pill form), topical, systemic, ocular, 
intraperitoneal and/or intrathecal delivery. More detailed descriptions of ribozyme 
delivery and administration are provided in Int. Pat. Appl. Publ. No. WO 94/02595 and 
10 Int. Pat. Appl. Publ. No. WO 93/23569, each specifically incorporated herein by 
reference. 

Another means of accumulating high concentrations of a ribozyme(s) within 
cells is to incorporate the ribozyme-encoding sequences into a DNA expression vector. 
Transcription of the ribozyme sequences are driven from a promoter for eukaryotic 

15 RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase III (pol 
III). Transcripts from pol II or pol HI promoters will be expressed at high levels in all 
cells; the levels of a given pol II promoter in a given cell type will depend on the nature 
of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. 
Prokaryotic RNA polymerase promoters , may also be used, providing that the 

20 prokaryotic RNA polymerase enzyme is expressed in the appropriate cells Ribozymes 
expressed from such promoters have been shown to function in mammalian cells. Such 
transcription units can be incorporated into a variety of vectors for introduction into 
mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA 
vectors (such as adenovims or adeno-associated vectors), or viral RNA vectors (such as 

25 retroviral, semliki forest virus, sindbis virus vectors). 

In another embodiment of the invention, peptide nucleic acids (PNAs) 

compositions are provided. PNA is a DNA mimic in which the nucleobases are 

attached to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid 

Drug Dev. 1997 7(4) 431-37). PNA is able to be utilized in a number methods that 

30 traditionally have used RNA or DNA. Often PNA sequences perform better in 

techniques than the corresponding RNA or DNA sequences and have utilities that are 

not inherent to RNA or DNA. A review of PNA including methods of making, 
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characteristics of, and methods of using, is provided by Corey {Trends Biotechnol 1997 
Jun;15(6):224-9). As such, in certain embodiments, one may prepare PNA sequences 
that are complementary to one or more portions of the ACE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the translation of 
5 ACE-specific mRNA, and thereby alter the level of ACE activity in a host cell to which 
such PNA compositions have been administered. 

PNAs have 2-aminoethyl-glycine linkages replacing the normal phosphodiester 
backbone of DNA (Nielsen et al.. Science 1991 Dec 6;254(5037): 1497-500; Hanvey et 
al., Science. 1992 Nov 27;258(5087): 1481-5; Hyrup and Nielsen, Bioorg Med Chem. 

10 1996 Jan;4(l):5-23). This chemistry has three important consequences: firstly, in 
contrast to DNA or phosphorothioate oligonucleotides, PNAs are neutral molecules; 
secondly, PNAs are achiral, which avoids the need to develop a stereoselective 
synthesis; and thirdly, PNA synthesis uses standard Boc or Fmoc protocols for solid- 
phase peptide synthesis, although other methods, including a modified Merrifield 

1 5 method, have been used. 

PNA monomers or ready-made oligomers are conmiercially available from 
PerSeptive Biosystems (Framingham, MA). PNA syntheses by either Boc or Fmoc 
protocols are straightforward using manual or automated protocols (Norton et al., 
Bioorg Med Chem. 1995 Apr;3(4):437-45). The manual protocol lends itself to the 
20 production of chemically modified PNAs or the simultaneous synthesis of families of 
closely related PNAs. 

As with peptide synthesis, the success of a particular PNA synthesis will depend 
on the properties of the chosen sequence. For example, while in theory PNAs can 
incorporate any combination of nucleotide bases, the presence of adjacent purines can 

25 lead to deletions of one or more residues in the product. In expectation of this 
difficulty, it is suggested that, in producing PNAs with adjacent purines, one should 
repeat the coupling of residues likely to be added inefficiently. This should be 
followed by the purification of PNAs by reverse-phase high-pressure liquid 
chromatography, providing yields and purity of product similar to those observed 

30 during the synthesis of peptides. 
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Modifications of PNAs for a given application may be accomplished by 
coupling amino acids during solid-phase synthesis or by attaching compounds that 
contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs 
can be modified after synthesis by coupling to an introduced lysine or cysteine. The 
5 ease with which PNAs can be modified facilitates optimization for better solubility or 
for specific functional requirements. Once synthesized, the identity of PNAs and their 
derivatives can be confirmed by mass spectrometry. Several studies have made and 
utilized modifications of PNAs (for example, Norton et al., Bioorg Med Chem. 1995 
Apr;3(4):437-45; Petersen et al., J Pept Sci. 1995 May-Jun; 1(3): 175-83; Orum et al., 

10 Biotechniques. 1995 Sep;19(3):472-80; Footer et al., Biochemistry, 1996 Aug 
20;35(33): 10673-9; Griffith et al.. Nucleic Acids Res. 1995 Aug 1 1;23(15):3003-8; 
Pardridge et al., Proc Natl Acad Sci USA. 1995 Jun 6;92(12):5592-6; Boffa et al., 
Proc Natl Acad Sci USA. 1995 Mar 14;92(6): 1901-5; Gambacorti-Passerini et al., 
Blood. 1996 Aug 15;88(4):1411-7; Armitage et al., Proc Natl Acad Sci USA. 1997 

15 Nov 11;94(23): 12320-5; Seeger et al., Biotechniques. 1997 Sep;23(3):512-7). U.S. 
Patent No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in 
diagnostics, modulating protein in organisms, and treatment of conditions susceptible to 
therapeutics. 

Methods of characterizing the antisense binding properties of PNAs are 
20 discussed in Rose (Anal Chem. 1993 Dec 15;65(24):3545-9) and Jensen et al. 
(Biochemistry. 1997 Apr 22;36(16):5072-7). Rose uses capillary gel electrophoresis to 
determine binding of PNAs to their complementary oligonucleotide, measuring the 
relative binding kinetics and stoichiometry. Similar types of measurements were made 
by Jensen et al. using BIAcore™ technology. 

25 Other applications of PNAs that have been described and will be apparent to the 

skilled artisan include use in DNA strand invasion, antisense inhibition, mutational 
analysis, enhancers of transcription, nucleic acid purification, isolation of 
transcriptionally active genes, blocking of transcription factor binding, genome 
cleavage, biosensors, in situ hybridization, and the like. 
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Polynucleotide Characterization and Expression 

Polynucleotides compositions of the present invention may be prepared and/or 
manipulated using any of a variety of well established techniques (see generally, 
Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
5 Laboratories, Cold Spring Harbor, NY, 1989, and other like references). 

In other embodiments of the invention, polynucleotide sequences or fragments 
thereof which encode polypeptides of the invention, or fusion proteins or functional 
equivalents thereof, may be used in recombinant DNA molecules to direct expression 
of a polypeptide in appropriate host cells. Due to the inherent degeneracy of the 
10 genetic code, other DNA sequences that encode substantially the same or a functionally 
equivalent amino acid sequence may be produced and these sequences may be used to 
clone and express a given polypeptide. 

As will be understood by those of skill in the art, it may be advsintageous in 
some instances to produce polypeptide-encoding nucleotide sequences possessing non- 
15 naturally occurring codons. For example, codons preferred by a particular prokaryotic 
or eukaryotic host can be selected to increase the rate of protein expression or to 
produce a recombinant RNA transcript having desirable properties, such as a half-life 
which is longer than that of a transcript generated from the naturally occurring 
sequence. 

20 Moreover, the polynucleotide sequences of the present invention can be 

engineered using methods generally known in the art in order to alter polypeptide 
encoding sequences for a variety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product. For 
example, DNA shuffling by random fragmentation and PCR reassembly of gene 

25 fragments and synthetic oligonucleotides may be used to engineer the nucleotide 
sequences. In addition, site-directed mutagenesis may be used to insert new restriction 
sites, alter glycosylation pattems, change codon preference, produce splice variants, or 
introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or recombinant 
30 nucleic acid sequences may be ligated to a heterologous sequence to encode a fusion 
protein. For example, to screen peptide libraries for inhibitors of polypeptide activity, it 
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may be useful to encode a chimeric protein that can be recognized by a commercially 
available antibody. A fusion protein may also be engineered to contain a cleavage site 
located between the polypeptide-encoding sequence and the heterologous protein 
sequence, so that the polypeptide may be cleaved and purified away from the 
5 heterologous moiety. 

Sequences encoding a desired polypeptide may be synthesized, in whole or in 
part, using chemical methods well known in the art (see Caruthers, M. H. et al. (1980) 
Nucl Acids Res. Symp, Sen 215-223, Horn, T. et al (1980) NucL Acids Res. Symp. Sen 
225-232). Alternatively, the protein itself may be produced using chemical methods to 
10 synthesize the amino acid sequence of a polypeptide, or a portion thereof. For example, 
peptide synthesis can be performed using various solid-phase techniques (Roberge, J. 
Y. et al. (1995) Science 269:202-204) and automated synthesis may be achieved, for 
example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer, Palo Alto, CA). 

A newly synthesized peptide may be substantially purified by preparative high 
15 performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures and 
Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable 
techniques available in the art. The composition of the synthetic peptides may be 
confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, 
20 may be altered during direct synthesis and/or combined using chemical methods with 
sequences from other proteins, or any part thereof, to produce a variant polj^eptide. 

In order to express a desired polypeptide, the nucleotide sequences encoding 
the polypeptide, or functional equivalents, may be inserted into appropriate expression 
vector, i.e., a vector which contains the necessary elements for the transcription and 

25 translation of the inserted coding sequence. Methods which are well known to those 
skilled in the art may be used to construct expression vectors containing sequences 
encoding a polypeptide of interest and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, 
synthetic techniques, and in vivo genetic recombination. Such techniques are 

30 described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory 
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Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et al. (1989) 
Current Protocols in Molecular Biology, John Wiley & Sons, New York. N. Y. 

A variety of expression vector/host systems may be utilized to contain and 
express polynucleotide sequences. These include, but are not limited to, 
5 microorganisms such as bacteria transformed with recombinant bacteriophage, 
plasmid, or cosmid DNA expression vectors; yeast transformed with yeast expression 
vectors; insect cell systems infected with virus expression vectors (e.g., baculovirus); 
plant cell systems transformed with virus expression vectors (e.g., cauliflower mosaic 
virus, CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., 
10 Ti or pBR322 plasmids); or animal cell systems. 

The "control elements" or "regulatory sequences" present in an expression 
vector are those non-translated regions of the vector—enhancers, promoters, 5' and 3' 
untranslated regions—which interact with host cellular proteins to carry out 
transcription and translation. Such elements may vary in their strength and specificity. 

15 Depending on the vector system and host utilized, any number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used. 
For example, when cloning in bacterial systems, inducible promoters such as the 
hybrid lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or 
PSPORTl plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 

20 manmialian cell systems, promoters from mammalian genes or from mammalian 
viruses are generally preferred. If it is necessary to generate a cell line that contains 
multiple copies of the sequence encoding a polypeptide, vectors based on SV40 or 
EBV may be advantageously used with an appropriate selectable marker. 

In bacterial systems, any of a number of expression vectors may be selected 
25 depending upon the use intended for the expressed polypeptide. For example, when 
large quantities are needed, for example for the induction of antibodies, vectors which 
direct high level expression of fusion proteins that are readily purified may be used. 
Such vectors include, but are not limited to, the multifunctional E. coli cloning and 
expression vectors such as BLUESCRIPT (Stratagene), in which the sequence 
30 encoding the polypeptide of interest may be ligated into the vector in frame with 
sequences for the amino-terminal Met and the subsequent 7 residues of .beta.- 
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galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. 
M. Schuster (1989) 7. BioL Chem. 254:5503-5509); and the like. pGEX Vectors 
(Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion 
proteins with glutathione S-transferase (GST). In general, such fusion proteins are 
5 soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. Proteins made in such 
systems may be designed to include heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 
will. 

10 In the yeast, Saccharomyces cerevisiae, a number of vectors containing 

constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH 
may be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods 
EnzymoL 153:516-544. 

In cases where plant expression vectors are used, the expression of sequences 
15 encoding polypeptides may be driven by any of a number of promoters. For example, 
vired promoters such as the 35S and 19S promoters of CaMV may be used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO 
J. 6:307-311. Alternatively, plant promoters such as the small subunit of RUBISCO or 
heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO 7. 3:1671-1680; 
20 Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results 
Probl. Cell Differ, 77:85-105). These constructs can be introduced into plant cells by 
direct DNA transformation or pathogen-mediated transfection. Such techniques are 
described in a number of generally available reviews (see, for example, Hobbs, S. or 
Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 
25 Hill, New York, N.Y.; pp. 191-196). 

An insect system may also be used to express a polypeptide of interest. For 
example, in one such system, Autographa califomica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or 
in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 
30 non-essential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence 
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will render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which the polypeptide of interest may be expressed 
(Engelhard, E. K. et al. (1994) Proc. Natl Acad. ScL 91 :3224-3227). 

5 In mammalian host cells, a number of viral-based expression systems are 

generally available. For example, in cases where an adenovirus is used as an expression 
vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used 
10 to obtain a viable virus which is capable of expressing the polypeptide in infected host 
cells (Logan, J. and Shenk, T. (1984) Proc. Natl Acad. ScL 57:3655-3659). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host cells. 

Specific initiation signals may also be used to achieve more efficient translation 
15 of sequences encoding a polypeptide of interest. Such signals include the ATG 
initiation codon and adjacent sequences. In cases where sequences encoding the 
polypeptide, its initiation codon, and upstream sequences are inserted into the 
appropriate expression vector, no additional transcriptional or translational control 
signals may be needed. However, in cases where only coding sequence, or a portion 
20 thereof, is inserted, exogenous translational control signals including the ATG 
initiation codon should be provided. Furthermore, the initiation codon should be in the 
correct reading frame to ensure translation of the entire insert. Exogenous translational 
elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers which are 
25 appropriate for the particular cell system which is used, such as those described in the 
literature (Scharf, D. et al. (1994) Results ProbL Cell Differ. 20: 125-162). . 

In addition, a host cell strain may be chosen for its ability to modulate the 
expression of the inserted sequences or to process the expressed protein in the desired 
fashion. Such modifications of the polypeptide include, but are not limited to, 
30 acetylation, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 
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used to facilitate correct insertion, folding and/or function. Different host cells such as 
CHO, COS, HeLa, MDCK, HEK293, and WI38, which have specific cellular 
machinery and characteristic mechanisms for such post-translational activities, may be 
chosen to ensure the correct modification and processing of the foreign protein. 

5 For long-term, high-yield production of recombinant proteins, stable expression 

is generally preferred. For example, cell lines which stably express a polynucleotide of 
interest may be transformed using expression vectors which may contain viral origins 
of replication and/or endogenous expression elements and a selectable marker gene on 
the same or on a separate vector. Following the introduction of the vector, cells may be 
10 allowed to grow for 1-2 days in an enriched media before they are switched to selective 
media. The purpose of the selectable marker is to confer resistance to selection, and its 
presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be proliferated 
using tissue culture techniques appropriate to the cell type. 

15 Any number of selection systems may be used to recover transformed cell lines. 

These include, but are not limited to, the herpes simplex virus thymidine kinase 
(Wigler, M. et al. (1977) Cell 77:223-32) and adenine phosphoribosyltransferase 
(Lowy, I. et al. (1990) Cell 22:817-23) genes which can be employed in tk.sup.- or 
aprt.sup.- cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can 

20 be used as the basis for selection; for example, dhfr which confers resistance to 
methotrexate (Wigler, M. et al. (1980) Proc, Natl Acad. ScL 77:3567-70); npt, which 
confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. 
et al (1981) J. MoL BioL 750:1-14); and als or pat, which confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). 

25 Additional selectable genes have been described, for example, trpB, which allows cells 
to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in 
place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc, Natl Acad. Scl 
85:8047-51). The use of visible markers has gained popularity with such markers as 
anthocyanins, beta-glucuronidase and its substrate GUS, and luciferase and its 

30 substrate luciferin, being widely used not only to identify transformants, but also to 

quantify the amount of transient or stable protein expression attributable to a specific 

vector system (Rhodes, C. A. et al. (1995) Methods Mol Biol 55:121-131). 
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Although the presence/absence of marker gene expression suggests that the 
gene of interest is also present, its presence and expression may need to be confirmed. 
For example, if the sequence encoding a polypeptide is inserted within a marker gene 
sequence, recombinant cells containing sequences can be identified by the absence of 
5 marker gene function. Alternatively, a marker gene can be placed in tandem with a 
polypeptide-encoding sequence under the control of a single promoter. Expression of 
the mcu-ker gene in response to induction or selection usually indicates expression of 
the tandem gene as well. 

Alternatively, host cells that contain and express a desired polynucleotide 
10 sequence may be identified by a variety of procedures known to those of skill in the art. 
These procedures include, but are not limited to, DNA-DNA or DNA-RNA 
hybridizations and protein bioassay or immunoassay techniques which include, for 
example, membrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein. 

15 Host cells transformed with a polynucleotide sequence of interest may be 

cultured under conditions suitable for the expression and recovery of the protein from 
cell culture. The protein produced by a recombinant cell may be secreted or contained 
intracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides of the 

20 invention may be designed to contain signal sequences which direct secretion of the 
encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other 
recombinant constructions may be used to join sequences encoding a polypeptide of 
interest to nucleotide sequence encoding a polypeptide domain which will facilitate 
purification of soluble proteins. Such purification facilitating domains include, but are 

25 not limited to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS 
extension/affinity purification system (Immunex Corp., Seattle, Wash.). The inclusion 
of cleavable linker sequences such as those specific for Factor XA or enterokinase 

30 (Invitrogen. San Diego, Calif.) between the purification domain and the encoded 

polypeptide may be used to facilitate purification. One such expression vector provides 

for expression of a fusion protein containing a polypeptide of interest and a nucleic 
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acid encoding 6 histidine residues preceding a thioredoxin or an enterokinase cleavage 
site. The histidine residues facilitate purification on IMIAC (immobilized metal ion 
affinity chromatography) as described in Porath, J. et al. (1992, Prot. Exp. Purif, 3:263- 
281) while the enterokinase cleavage site provides a means for purifying the desired 
5 polypeptide from the fusion protein. A discussion of vectors which contain fusion 
proteins is provided in KroU, D. J. et al. (1993; DMA Cell Biol. 72:441-453). 

In addition to recombinant production methods, polypeptides of the invention, 
and fragments thereof, may be produced by direct peptide synthesis using solid-phase 
techniques (Merrifield J. (1963) J. Am. Chem. Soc. 55:2149-2154). Protein synthesis 
10 may be performed using manual techniques or by automation. Automated synthesis 
may be achieved, for example, using Applied Biosystems 431 A Peptide Synthesizer 
(Perkin Elmer). Altematively, various fragments may be chemicedly synthesized 
separately and combined using chemical methods to produce the full length molecule. 

15 T Cells Compositions 

The present invention, in another aspect, provides T cells specific for a tumor 
polypeptide disclosed herein, or for a variant or derivative thereof. Such cells may 
generally be prepared in vitro or ex vivo, using standard procedures. For example, T 
cells may be isolated from bone marrow, peripheral blood, or a fraction of bone marrow 
20 or peripheral blood of a patient, using a commercially available cell separation system, 
such as the Isolex™ System, available from Nexell Therapeutics, Inc. (Irvine, CA; see 
also U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 
91/16116 and WO 92/07243). Altematively, T cells may be derived from related or 
unrelated humans, non-human manmials, cell lines or cultures. 

25 T cells may be stimulated with a polypeptide, polynucleotide encoding a 

polypeptide and/or an antigen presenting cell (APC) that expresses such a polypeptide. 
Such stimulation is performed under conditions and for a time sufficient to permit the 
generation of T cells that are specific for the polypeptide of interest. Preferably, a 
tumor polypeptide or polynucleotide of the invention is present within a delivery 

30 vehicle, such as a microsphere, to facilitate the generation of specific T cells. 
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T cells are considered to be specific for a polypeptide of the present invention if 
the T cells specifically proliferate, secrete cytokines or kill target cells coated with the 
polypeptide or expressing a gene encoding the polypeptide. T cell specificity may be 
evaluated using any of a variety of standard techniques. For example, within a 
5 chromium release assay or proliferation assay, a stimulation index of more than two 
fold increase in lysis and/or proliferation, compared to negative controls, indicates T 
cell specificity. Such assays may be performed, for example, as described in Chen et 
al.. Cancer Res, 54:1065-1070, 1994. Alternatively, detection of the proliferation of 
T cells may be accomplished by a variety of known techniques. For example, T cell 

10 proliferation can be detected by measuring an increased rate of DNA synthesis {e.g., by 
pulse-labeling cultures of T cells with tritiated thymidine and measuring the amount of 
tritiated thymidine incorporated into DNA). Contact with a tumor polypeptide (100 
ng/ml - 100 |lg/ml, preferably 200 ng/ml - 25 |Llg/ml) for 3 - 7 days will typically result 
in at least a two fold increase in proliferation of the T cells. Contact as described above 

15 for 2-3 hours should result in activation of the T cells, as measured using standard 
cytokine assays in which a two fold increase in the level of cytokine release (e.g., TNF 
or IFN-y) is indicative of T cell activation {see Coligan et al.. Current Protocols in 
Immunology, vol. 1, Wiley Interscience (Greene 1998)). T cells that have been 
activated in response to a tumor polypeptide, polynucleotide or polypeptide-expressing 

20 APC may be CD4"*" and/or CD8"^. Tumor polypeptide-specific T cells may be expanded 
using standard techniques. Within preferred embodiments, the T cells are derived from 
a patient, a related donor or an unrelated donor, and are administered to the patient 
following stimulation and expansion. 

For therapeutic purposes, CD4+ or CD8+ T cells that proliferate in response to 
25 a tumor polypeptide, polynucleotide or APC can be expanded in number either in vitro 
or in vivo. Proliferation of such T cells in vitro may be accomplished in a variety of 
ways. For example, the T cells can be re-exposed to a tumor polypeptide, or a short 
peptide corresponding to an immunogenic portion of such a polypeptide, with or 
without the addition of T cell growth factors, such as interleukin-2, and/or stimulator 
30 cells that synthesize a tumor polypeptide. Alternatively, one or more T cells that 
proliferate in the presence of the tumor polypeptide can be expanded in number by 
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cloning. Methods for cloning cells are well known in the art, and include limiting 
dilution. 

Pharmaceutical Compositions 

5 In additional embodiments, the present invention concerns formulation of one 

or more of the polynucleotide, polypeptide or T-cell disclosed herein in 
pharmaceutically-acceptable solutions for administration to a cell or an animal, either 
alone, or in combination with one or more other modalities of therapy. In particular, 
the present invention concerns, the use of cripto polynucleotides, polypeptides, 
10 fragments, fusions and variants in a pharmaceutical composition for the treatment of 
tumours. In particular it is preferred that the Cripto is Cripto 1. 

It will be understood that, if desired, a composition as disclosed herein may be • 
administered in combination with other agents as well, such as, e.g., other proteins or 
polypeptides or various pharmaceutically-active agents. In fact, there is virtually no 

15 limit to other components that may also be included, given that the additional agents do 
not cause a significant adverse effect upon contact with the target cells or host tissues. 
The compositions may thus be delivered along with various other agents as required in 
the particular instance. Such compositions may be purified from host cells or other 
biological sources, or alternatively may be chemically synthesized as described herein. 

20 Likewise, such compositions may further comprise substituted or derivatized RNA or 
DNA compositions. 

Therefore, in another aspect of the present invention, pharmaceutical 
compositions are provided comprising one or more of the polynucleotide, polypeptide 
and/or T-cell compositions described herein in combination with a physiologically 

25 acceptable carrier. In certain preferred embodiments, the pharmaceutical compositions 
of the invention comprise inmiunogenic polynucleotide and/or polypeptide 
compositions of the invention for use in prophylactic and theraputic vaccine 
applications. Vaccine preparation is generally described in, for example, M,F, Powell 
and M.J. Newman, eds., "Vaccine Design (the subunit and adjuvant approach)," 

30 Plenum Press (NY, 1995). Generally, such compositions will comprise one or more 
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polynucleotide and/or polypeptide compositions of the present invention in 
combination with one or more immunostimulants. 

It will be apparent that any of the pharmaceutical compositions described herein 
can contain pharmaceutically acceptable salts of the polynucleotides and polypeptides 
5 of the invention. Such salts can be prepared, for example, from pharmaceutically 
acceptable non-toxic bases, including organic bases {e.g., salts of primary, secondary 
and tertiary amines and basic amino acids) and inorganic bases {e.g., sodium, 
potassium, lithium, ammonium, calcium and magnesium salts). 

In another embodiment, illustrative immunogenic compositions, e.g., vaccine 
10 compositions, of the present invention comprise DNA encoding one or more of the 
polypeptides as described above, such that the polypeptide is generated in situ. As 
noted above, the polynucleotide may be administered within any of a variety of 
delivery systems known to those of ordinary skill in the art. Indeed, numerous gene 
delivery techniques are well known in the art, such as those described by Rolland, Crit. 
15 Rev. Therap. Drug Carrier Systems 75:143-198, 1998, and references cited therein. 
Appropriate polynucleotide expression systems will, of course, contain the necessary 
regulatory DNA regulatory sequences for expression in a patient (such as a suitable 
promoter and terminating signal). Alternatively, bacterial delivery systems may 
involve the administration of a bacterium (such as Bacillus-Calmette-Guerrin) that 
20 expresses an inmiunogenic portion of the polypeptide on its cell surface or secretes 
such an epitope. 

Therefore, in certain embodiments, polynucleotides encoding immunogenic 
polypeptides described herein are introduced into suitable mammalian host cells for 
expression using any of a number of known viral-based systems. In one illustrative 

25 embodiment, retroviruses provide a convenient and effective platform for gene delivery 
systems. A selected nucleotide sequence encoding a polypeptide of the present 
invention can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombinant virus can then be isolated and delivered 
to a subject. A number of illustrative retroviral systems have been described (e.g., U.S. 

30 Pat. No. 5,219,740; Miller and Rosman (1989) BioTechniques 7:980-990; Miller, A. D. 
(1990) Human Gene Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Bums 
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et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
(1993) Cur. Opin. Genet. Develop. 3:102-109. 

In addition, a number of illustrative adenovirus-based systems have also been 
described. Unlike retroviruses which integrate into the host genome, adenoviruses 
5 persist extrachromosomally thus minimizing the risks associated with insertional 
mutagenesis (Haj-Ahmad and Graham (1986) J. Virol. 57:267-274; Bett et al. (1993) J. 
Virol. 67:591 1-5921; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et 
al. (1994) J. ViroL 68:933-940; Barr et al. (1994) Gene Therapy 1:51-58; Berkner, K. 
L. (1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 
10 4:461-476). 

Various adeno-associated virus (AAV) vector systems have also been 
developed for polynucleotide delivery. AAV vectors can be readily constructed using 
techniques well known in the art. See, e.g., U.S. Pat. Nos. 5,173,414 and 5,139,941; 
Intemational Publication Nos. WO 92/01070 and WO 93/03769; Lebkowski et al. 
15 (1988) Molec. Cell. Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring 
Harbor Laboratory Press); Carter, B. J. (1992) Current Opinion in Biotechnology 
3:533-539; Muzyczka, N. (1992) Current Topics in Microbiol, and Immunol. 158:97- 
129; Kotin, R. M. (1994) Human Gene Therapy 5:793-801; Shelling and Smith (1994) 
Gene Therapy 1:165-169; and Zhou et al. (1994) J. Exp. Med. 179:1867-1875. 

20 Additional viral vectors useful for delivering the nucleic acid molecules 

encoding polypeptides of the present invention by gene transfer include those derived 
from the pox family of viruses, such as vaccinia virus and avian poxvirus. By way of 
example, vaccinia virus recombinants expressing the novel molecules can be 
constructed as follows. The DNA encoding a polypeptide is first inserted into an 

25 appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence encoding thymidine kinase (TK). This vector is 
then used to transfect cells which are simultaneously infected with vaccinia. 
Homologous recombination serves to insert the vaccinia promoter plus the gene 
encoding the polypeptide of interest into the viral genome. The resulting TK.sup.(-) 

30 recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxjoiridine and picking viral plaques resistant thereto. 
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A vaccinia-based infection/transfection system can be conveniently used to 
provide for inducible, transient expression or coexpression of one or more polypeptides 
described herein in host cells of an organism. In this particular system, cells are first 
infected in vitro with a vaccinia virus recombinant that encodes the bacteriophage T7 
5 RNA polymerase. This polymerase displays exquisite specificity in that it only 
transcribes templates bearing T7 promoters. Following infection, cells are transfected 
with the polynucleotide or polynucleotides of interest, driven by a T7 promoter. The 
polymerase expressed in the cytoplasm from the vaccinia virus recombinant transcribes 
the transfected DNA into RNA which is then translated into polypeptide by the host 
10 translational machinery. The method provides for high level, transient, cytoplasmic 
production of large quantities of RNA and its translation products. See, e.g., Elroy- 
Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743-6747; Fuerst et al. Proc. 
Natl. Acad. Sci. USA (1986) 83:8122-8126. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, can 
15 also be used to deliver the coding sequences of interest. Recombinant avipox viruses, 
expressing immunogens from mammalian pathogens, are known to confer protective 
immunity when administered to non-avian species. The use of an Avipox vector is 
particularly desirable in human and other mammalian species since members of the 
Avipox genus can only productively replicate in susceptible avian species and therefore 
20 are not infective in mammalian cells. Methods for producing recombinant 
Avipoxviruses are known in the art and employ genetic recombination, as described 
above with respect to the production of vaccinia viruses. See, e.g., WO 91/12882; WO 
89/03429; and WO 92/03545. 

Any of a number of alphavirus vectors can also be used for delivery of 
25 polynucleotide compositions of the present invention, such as those vectors described 
in U.S. Patent Nos. 5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors 
based on Venezuelan Equine Encephalitis (VEE) can also be used, illustrative examples 
of which can be found in U.S. Patent Nos. 5,505,947 and 5,643,576. 

Moreover, molecular conjugate vectors, such as the adenovirus chimeric vectors 
30 described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et al. 
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Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery 
under the invention. 

Additional illustrative information on these and other known viral-based 
delivery systems can be found, for example, in Fisher-Hoch et al., Proc. Natl, Acad, 
5 Sci, USA 55:317-321, 1989; Flexner et al., Ann, N,Y. Acad, Sci. 559:86-103, 1989; 
Flexner et al.. Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,1 12, 4,769,330, and 
5,017,487; WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; 
WO 91/02805; Berkner, Biotechniques 5:616-627, 1988; Rosenfeld et al.. Science 
252:431-434, 1991; KoUs et al., Proc. Natl Acad, Sci. USA 97:215-219, 1994; 
10 Kass-Eisler et al., Proc. Natl. Acad. Sci. USA 90:11498-11502, 1993; Guzman et al.. 
Circulation 55:2838-2848, 1993; and Guzman et al., Cir. Res. 73:1202-1207, 1993. 

In another embodiment of the invention, a polynucleotide is 
administered/delivered as "naked" DNA, for example as described in Ulmer et al.. 
Science 259:1745-1749, 1993 and reviewed by Cohen, Science 259:1691-1692, 1993. 
15 The uptake of naked DNA may be increased by coating the DNA onto biodegradable 
beads, which are efficiently transported into the cells. 

In still another embodiment, a composition of the present invention can be 
delivered via a particle bombardment approach, many of which have been described. 
In one illustrative example, gas-driven particle acceleration can be achieved with 

20 devices such as those manufactured by Powderject Pharmaceuticals PLC (Oxford, UK) 
and Powderject Vaccines Inc. (Madison, WI), some examples of which are described in 
U.S. Patent Nos. 5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 
799. This approach offers a needle-free delivery approach wherein a dry powder 
formulation of microscopic particles, such as polynucleotide or polypeptide particles, 

25 are accelerated to high speed within a helium gas jet generated by a hand held device, 
propelling the particles into a target tissue of interest. The particles, when delivering 
nucleic acid are preferably gold beads of a 0.4 - 4.0 um, more preferably 0.6 - 2.0 um 
diameter and the DNA conjugate coated onto these and then encased in a cartridge for 
placing into the "gene gun". The particles are typically and preferably delivered to the 

30 skin. Other means of delivery to the skin, comprise utilising needle delivery via a 
needle of a liquid formulation. 
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DN A vaccines usually consist of a bacterial plasmid vector into which is 
inserted a strong, normally viral, promoter, the gene of interest which encodes for an 
antigenic peptide and a polyadenylation/transcriptional termination sequences. Thus 
gene of interest may encode a full cripto protein as describedor simply an antigenic 
5 peptide sequence such as described in seq ID no 13 -94. The plasmid can be grown in 
bacteria, such as for example E.coli and then isolated and prepared in an appropriate 
medium, depending upon the intended route of administration, before being 
administered to the host. Following administration the plasmid is taken up by cells of 
the host where the encoded peptide is produced. The plasmid vector will preferably be 

10 made without an origin of replication which is functional in eukaryotic cells, in order to 
prevent plasmid replication in the mammalian host and integration within chromosomal 
DNA of the animal concerned. 

There are a number of advantages of DNA vaccination relative to traditional 
vaccination techniques. First, it is predicted that because of the proteins which are 

15 encoded by the DNA sequence are synthesised in the host, the structure or 

conformation of the protein will be similar to the native protein associated with the 
disease state. It is also likely that DNA vaccination will offer protection against 
different strains of a virus, by generating cytotoxic T lymphocyte response that 
recognise epitopes from conserved proteins. Furthermore, because the plasmids are 

20 taken up by the host cells where antigenic protein can be produced, a long-lasting 
inmiune response will be elicited. The technology also offers the possibility of 
combing diverse immunogens/epitopes into a single preparation . 

Helpful background information in relation to DNA vaccination is provided in 
Donnelly et al "DNA vaccines" Ann. Rev Immunol. 1997 15: 617-648, the disclosure 

25 of which is included herein in its' entirety by way of reference. 

In a related embodiment, other devices and methods that may be useful for gas- 
driven needle-less injection of compositions of the present invention include those 
provided by Bioject, Inc. (Portland, OR), some examples of which are described in U.S. 
Patent Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 and 

30 5,993,412. 

According to another embodiment, the pharmaceutical compositions described 
herein will comprise one or more inununostimulants in addition to the immunogenic 
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polynucleotide, polypeptide, T-cell and/or APC compositions of this invention. An 
inimunostimulant refers to essentially any substance that enhances or potentiates an 
immune response (antibody and/or cell-mediated) to an exogenous antigen. One 
preferred type of immunostimulant comprises an adjuvant. Many adjuvants contain a 
5 substance designed to protect the antigen from rapid catabolism, such as aluminum 
hydroxide or mineral oil, and a stimulator of inmiune responses, such as lipid A, 
Bordatella pertussis or Mycobacterium tuberculosis derived proteins. Certain 
adjuvants are commercially available as, for example, Freund's Incomplete Adjuvant 
and Complete Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 65 (Merck 

10 and Company, Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, PA); 
aluminum salts such as aluminum hydroxide gel (alum) or aluminum phosphate; salts 
of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated sugars; 
cationically or anionically derivatized polysaccharides; polyphosphazenes; 
biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as 

15 GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be used as 
adjuvants. 

Within certain embodiments of the invention, the adjuvant composition is 
preferably one that induces an immune response predominantly of the Thl type. High 
levels of Thl -type cytokines {e.g., IFN-y, TNFa, IL-2 and IL-12) tend to favor the 

20 induction of cell mediated immune responses to an administered antigen. In contrast, 
high levels of Th2-type cytokines {e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the 
induction of humoral inmiune responses. Following application of a vaccine as 
provided herein, a patient will support an immune response that includes Thl- and Th2- 
type responses. Within a preferred embodiment, in which a response is predominantly 

25 Thl -type, the level of Thl -type cytokines will increase to a greater extent than the level 
of Th2-type cytokines. The levels of these C3^okines may be readily assessed using 
standard assays. For a review of the families of cj^okines, see Mosmann and Coffman, 
Ann. Rev. Immunol. 7:145-173, 1989. 

Certain adjuvants for eliciting a predominantly Thl -type response include, for 
30 example, a combination of monophosphoryl lipid A, preferably 3-de-O-acylated 
monophosphoryl lipid A, together with an aluminum salt. MPL® adjuvants are 
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available from Corixa Coq^oration (Seattle, WA; see^ for example, US Patent Nos. 
4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing oligonucleotides (in 
which the CpG dinucleotide is unmethylated) also induce a predominantly Thl 
response. Such oligonucleotides are well known and are described, for example, in 
5 WO 96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 5,856,462. 
Immunostimulatory DNA sequences are also described, for example, by Sato et al.. 
Science 275:352, 1996. Another preferred adjuvant comprises a saponin, such as Quil 
A, or derivatives thereof, including QS21 and QS7 (Aquila Biopharmaceuticals Inc., 
Framingham, MA); Escin; Digitonin; or Gypsophila or Chenopodium quinoa saponins . 
10 Other preferred formulations include more than one saponin in the adjuvant 
combinations of the present invention, for example combinations of at least two of the 
following group comprising QS21, QS7, Quil A, (J-escin, or digitonin. 

Alternatively the saponin formulations may be combined with vaccine vehicles 
composed of chitosan or other polycationic polymers, polylactide and polylactide-co- 

15 glycolide particles, poly-N-acetyl glucosamine-beised polymer matrix, particles 

composed of polysaccharides or chemically modified polysaccharides, liposomes and 
lipid-based particles, particles composed of glycerol monoesters, etc. The saponins may 
also be formulated in the presence of cholesterol to form particulate structures such as 
liposomes or ISCOMs. Furthermore, the saponins may be formulated together with a 

20 polyoxyethylene ether or ester, in either a non -particulate solution or suspension, or in a 
particulate structure such as a paucilamelar liposome or IS COM. The saponins may 
also be formulated with excipients such as Carbopol^ to increase viscosity, or may be 
formulated in a dry powder form with a powder excipient such as lactose. 

In one preferred embodiment, the adjuvant system includes the combination of a 
25 monophosphoryl lipid A and a saponin derivative, such as the combination of QS21 
and 3D-MPL adjuvant, as described in WO 94/00153, or a less reactogenic 
composition where the QS21 is quenched with cholesterol, as described in WO 
96/33739. Other preferred formulations comprise an oil-in-water emulsion and 
tocopherol. Another particularly preferred adjuvant formulation employing QS21, 3D- 
30 MPL® adjuvant and tocopherol in an oil-in-water emulsion is described in WO 
95/17210. 
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Another enhanced adjuvant system involves the combination of a CpG- 
containing oligonucleotide and a saponin derivative particularly the combination of 
CpG and QS21 as disclosed in WO 00/09159. Preferably the formulation additionally 
comprises an oil in water emulsion and tocopherol. 
5 Additional illustrative adjuvants for use in the pharmaceutical compositions of 

the invention include Montanide ISA 720 (Seppic, France), SAP (Chiron, California, 
United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series of adjuvants (e.g., 
SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, Belgium), Detox 
(Enhanzyn®) (Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, MT) and other 
10 aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described in pending 
U.S. Patent Application Serial Nos. 08/853,826 and 09/074,720, the disclosures of 
which arc incorporated herein by reference in their entireties, and polyoxyethylene 
ether adjuvants such as those described in WO 99/52549A1. 

Other preferred adjuvants include adjuvant molecules of the general formula (I): 

1 5 HO(CH2CH20)n- A-R 

Wherein, n is 1-50, A is a bond or -C(0)-, R is C1.50 alkyl or Phenyl C1.50 alkyl. 
One embodiment of the present invention consists of a vaccine formulation 
comprising a polyoxyethylene ether of general formula (I), wherein n is between 1 and 
50, preferably 4-24, most preferably 9; the R component is C1.50, preferably C4-C20 

20 alkyl and most preferably C12 alkyl, and A is a bond. The concentration of the 
polyoxyethylene ethers should be in the range 0.1-20%, preferably from 0.1-10%, and 
most preferably in the range 0,1-1%. Preferred polyoxyethylene ethers are selected 
from the following group: polyoxyethylene-9-lauryl ether, polyoxyethylene-9-steoryl 
ether, polyoxyethylene-8-steoryl ether, polyoxyethylene-4-lauryl ether, 

25 polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl ether. 
Polyoxyethylene ethers such as polyoxyethylene lauryl ether are described in the Merck 
index (12* edition: entry 7717). These adjuvant molecules are described in WO 
99/52549. 

The polyoxyethylene ether according to the general formula (I) above may, if 
30 desired, be combined with another adjuvant. For example, a preferred adjuvant 
combination is preferably with CpG as described in the pending UK patent application 
GB 9820956.2. 
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According to another embodiment of this invention, an immunogenic 
composition described herein is delivered to a host via antigen presenting cells (APCs), 
such as dendritic cells, macrophages, B cells, monocytes and other cells that may be 
engineered to be efficient APCs. Such cells may, but need not, be genetically modified 
5 to increase the capacity for presenting the antigen, to improve activation and/or 
maintenance of the T cell response, to have anti-tumor effects per se and/or to be 
immunologically compatible with the receiver (i.e., matched HLA haplotype). APCs 
may generally be isolated from any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic 
10 or xenogeneic cells. 

Certain preferred embodiments of the present invention use dendritic cells or 
progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent APCs 
(Banchereau and Steinman, Nature 592:245-251, 1998) and have been shown to be 
effective as a physiological adjuvEint for eliciting prophylactic or therapeutic antitumor 

15 immunity {see Tinmierman and Levy, Ann, Rev. Med. 50:507-529, 1999). In general, 
dendritic cells may be identified based on their typical shape (stellate in situ, with 
marked cytoplasmic processes (dendrites) visible in vitro) ^ their ability to take up, 
process and present antigens with high efficiency and their ability to activate naive T 
cell responses. Dendritic cells may, of course, be engineered to express specific cell- 

20 surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex 
vivo, and such modified dendritic cells are contemplated by the present invention. As 
an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be used within a vaccine {see Zitvogel et al.. Nature Med. -#:594-600, 
1998). 

25 Dendritic cells and progenitors may be obtained from peripheral blood, bone 

marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph nodes, 
spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For example, 
dendritic cells may be differentiated ex vivo by adding a combination of cytokines such 
as GM-CSF, IL-4, IL-13 and/or TNFa to cultures of monocytes harvested from 

30 peripheral blood. Alternatively, CD34 positive cells harvested from peripheral blood, 
umbilical cord blood or bone marrow may be differentiated into dendritic cells by 
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adding to the culture medium combinations of GM-CSF, IL-3, TNFa, CD40 ligand, 
LPS, flt3 ligand and/or other compound(s) that induce differentiation, maturation and 
proliferation of dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" cells, 
5 which allows a simple way to discriminate between two well characterized phenotypes. 
However, this nomenclature should not be construed to exclude all possible 
intermediate stages of differentiation. Immature dendritic cells are characterized as 
APC with a high capacity for antigen uptake and processing, which correlates with the 
high expression of Fey receptor and mannose receptor. The mature phenotype is 
10 typically characterized by a lower expression of these markers, but a high expression of 
cell surface molecules responsible for T cell activation such as class I and class II 
MHC, adhesion molecules (e,g„ CD54 and CD 11) and costimulatory molecules (e.g., 
CD40, CD80, CD86 and 4- IBB). 

APCs may generally be transfected with a polynucleotide of the invention (or 

15 portion or other variant thereof) such that the encoded polypeptide, or an immunogenic 
portion thereof, is expressed on the cell surface. Such transfection may take place ex 
v/vo, and a pharmaceutical composition comprising such transfected cells may then be 
used for therapeutic purposes, as described herein. Alternatively, a gene delivery 
vehicle that targets a dendritic or other antigen presenting cell may be administered to a 

20 patient, resulting in transfection that occurs in vivo. In vivo and ex vivo transfection of 
dendritic cells, for example, may generally be performed using any methods known in 
the art, such as those described in WO 97/24447, or the gene gun approach described 
by Mahvi et al., Immunology and cell Biology 75:456-460, 1997. Antigen loading of 
dendritic cells may be achieved by incubating dendritic cells or progenitor cells with 

25 the tumor polypeptide, DNA (naked or within a plasmid vector) or RNA; or with 
antigen -expressing recombinant bacterium or viruses (e,g., vaccinia, fowlpox, 
adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be covalently 
conjugated to an immunological partner that provides T cell help (e.g., a carrier 
molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated 

30 immunological partner, separately or in the presence of the polypeptide. 
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While any suitable carrier known to those of ordinary skill in the art may be 
employed in the pharmaceutical compositions of this invention, the type of carrier will 
tj^ically vary depending on the mode of administration. Compositions of the present 
invention may be formulated for any appropriate manner of administration, including 
5 for example, topical, oral, nasal, mucosal, intravenous, intracranial, intraperitoneal, 
subcutaneous and intramuscular administration. 

Carriers for use within such pharmaceutical compositions are biocompatible, 
and may also be biodegradable. In certain embodiments, the formulation preferably 
provides a relatively constant level of active component release. In other embodiments, 

10 however, a more rapid rate of release inmiediately upon administration may be desired. 
The formulation of such, compositions is well within the level of ordinary skill in the art 
using known techniques. Illustrative carriers useful in this regard include 
microparticles of poly(lactide-co-glycolide), polyacrylate, latex, starch, cellulose, 
dextran and the like. Other illustrative delayed-release carriers include supramolecular 

15 biovectors, which comprise a non-liquid hydrophilic core (e.g., a cross-linked 
polysaccharide or oligosaccharide) and, optionally, an external layer comprising an 
amphiphilic compound, such as a phospholipid (see e.g., U.S. Patent No. 5,151,254 and 
PCX applications WO 94/20078, WO/94/23701 and WO 96/06638). The amount of 
active compound contained within a sustained release formulation depends upon the 

20 site of implantation, the rate and expected duration of release and the nature of the 
condition to be treated or.prevented. 

In another illustrative embodiment, biodegradable microspheres (e.g., 
polylactate polyglycolate) are employed as carriers for the compositions of this 
invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 

25 Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 
5,814,344, 5,407,609 and 5,942,252. Modified hepatitis B core protein carrier systems, 
such as described in WO/99 40934, and references cited therein, will also be useful for 
many applications. Another illustrative carrier/delivery system employs a carrier 
comprising particulate-protein complexes, such as those described in U.S. Patent No. 

30 5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte 
responses in a host. 
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The pharmaceutical compositions of the invention will often further comprise 
one or more buffers (e.g., neutral buffered saline or phosphate buffered saline), 
carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, 
polypeptides or amino acids such as glycine, antioxidants, bacteriostats, chelating 
5 agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that 
render the formulation isotonic, hjqpotonic or weakly hypertonic with the blood of a 
recipient, suspending agents, thickening agents and/or preservatives. Alternatively, 
compositions of the present invention may be formulated as a lyophilizate. 

The pharmaceutical compositions described herein may be presented in unit- 
10 dose or multi-dose containers, such as sealed ampoules or vials. Such containers are 
typically sealed in such a way to preserve the sterility and stability of the formulation 
until use. In general, formulations may be stored as suspensions, solutions or 
emulsions in oily or aqueous vehicles. Alternatively, a pharmaceutical composition 
may be stored in a freeze-dried condition requiring only the addition of a sterile liquid 
15 carrier immediately prior to use. 

The development of suitable dosing and treatment regimens for using the 
particular compositions described herein in a variety of treatment regimens, including 
e.g., oral, parenteral, intravenous, intranasal, and intramuscular administration and 
formulation, is well known in the art, some of which are briefly discussed below for 
20 general purposes of illustration. 

In certain applications, the pharmaceutical compositions disclosed herein may 
be delivered via oral administration to an animal. As such, these compositions may be 
formulated with an inert diluent or with an assimilable edible carrier, or they may be 
enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into tablets, 
25 or they may be incorporated directly with the food of the diet. 

The active compounds may even be incorporated with excipients and used in 
the form of ingestible tablets, buccal tables, troches, capsules, elixirs, suspensions, 
syrups, wafers, and the like (see, for example, Mathiowitz et al.. Nature 1997 Mar 
27;386(6623):410-4; Hwang et al., Crit Rev Ther Drug Carrier Syst 1998;15(3):243-84; 
30 U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 5,792,451). Tablets, 
troches, pills, capsules and the like may also contain any of a variety of additional 
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components, for example, a binder, such as gum tragacanth, acacia, cornstarch, or 
gelatin; excipients, such as dicalcium phosphate; a disintegrating agent, such as com 
starch, potato starch, alginic acid and the like; a lubricant, such as magnesium stearate; 
and a sweetening agent, such as sucrose, lactose or saccharin may be added or a 
5 flavoring agent, such as peppermint, oil of wintergreen, or cherry flavoring. When the 
dosage unit form is a capsule, it may contain, in addition to materials of the above type, 
a liquid carrier. Various other materials may be present as coatings or to otherwise 
modify the physical form of the dosage unit. For instance, tablets, pills, or capsules 
may be coated with shellac, sugar, or both. Of course, any material used in preparing 
10 any dosage unit form should be pharmaceutically pure and substantially non-toxic in 
the amounts employed. In addition, the active compounds may be incorporated into 
sustained-release preparation and formulations. 

Typically, these formulations will contain at least about 0.1% of the active 
compound or more, although the percentage of the active ingredient(s) may, of course, 

15 be varied and may conveniently be between about 1 or 2% and about 60% or 70% or 
more of the weight or volume of the total formulation. Naturally, the amount of active 
compound(s) in each therapeutically useful composition may be prepared is such a way 
that a suitable dosage will be obtained in any given unit dose of the compound. Factors 
such as solubility, bioavailability, biological half-life, route of administration, product 

20 shelf life, as well as other pharmacological considerations will be contemplated by one 
skilled in the art of preparing such pharmaceutical formulations, and as such, a variety 
of dosages and treatment regimens may be desirable. 

For oral administration the compositions of the present invention may 
alternatively be incorporated with one or more excipients in the form of a mouthwash, 

25 dentifrice, buccal tablet, oral spray, or sublingual orally-administered formulation. 
Altematively, the active ingredient may be incorporated into an oral solution such as 
one containing sodium borate, glycerin and potassium bicarbonate, or dispersed in a 
dentifrice, or added in a therapeutically-effective amount to a composition that may 
include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 

30 Altematively the compositions may be fashioned into a tablet or solution form that may 
be placed under the tongue or otherwise dissolved in the mouth. 
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In certain circumstances it will be desirable to deliver the pharmaceutical 
compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
intraperitoneally or intradermally. Such approaches are well known to the skilled 
artisan, some of which are further described, for example, in U. S. Patent 5,543,158; U. 
5 S. Patent 5,641,515 and U. S. Patent 5,399,363. In certain embodiments, solutions of 
the active compounds as free base or pharmacologically acceptable salts may be 
prepared in water suitably mixed with a surfactant, such as hydroxypropylcellulose. 
Dispersions may also be prepared in glycerol, liquid polyethylene glycols, and mixtures 
thereof and in oils. Under ordinary conditions of storage and use, these preparations 
10 generally will contain a preservative to prevent the growth of microorganisms. 

Dlustrative pharmaceutical forms suitable for injectable use include sterile 
aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions (for example, see U. S. Patent 
5,466,468). In all cases the form must be sterile and must be fluid to the extent that 

15 easy syringability exists. It must be stable under the conditions of manufacture and 
storage and must be preserved against the contantiinating action of microorganisms, 
such as bacteria and fungi. The carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and 
liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable 

20 oils. Proper fluidity may be maintained, for example, by the use of a coating, such as 
lecithin, by the maintenance of the required particle size in the case of dispersion and/or 
by the use of surfactants. The prevention of the action of microorganisms can be 
facilitated by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be 

25 preferable to include isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in 
the compositions of agents delaying absorption, for example, aluminum monostearate 
and gelatin. 

In one embodiment, for parenteral administration in an aqueous solution, the 

30 solution should be suitably buffered if necessary and the liquid diluent first rendered 

isotonic with sufficient saline or glucose. These particular aqueous solutions are 

especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal 
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administration. In this connection, a sterile aqueous medium that can be employed will 
be known to those of skill in the art in light of the present disclosure. For example, one 
dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml 
of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
5 "Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570- 
1580). Some variation in dosage will necessarily occur depending on the condition of 
the subject being treated. Moreover, for human administration, preparations will of 
course preferably meet sterility, pyrogenicity, and the general safety and purity 
standards as required by FDA Office of Biologies standards. 

10 In another embodiment of the invention, the compositions disclosed herein may 

be formulated in a neutral or salt form. Illustrative pharmaceutically-acceptable salts 
include the acid addition salts (formed with the free amino groups of the protein) and 
which are formed with inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the 

15 like. Salts formed with the free carboxyl groups can also be derived from inorganic 
bases such as, for example, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylamine, trimethylamine, histidine, 
procaine and the like. Upon formulation, solutions will be administered in a manner 
compatible with the dosage formulation and in such amount as is therapeutically 

20 effective. 

The carriers can further comprise any and all solvents, dispersion media, 
vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption 
delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use 
of such media and agents for pharmaceutical active substances is well known in the art. 
25 Except insofar as any conventional media or agent is incompatible with the active 
ingredient, its use in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. The phrase 
"pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar untoward reaction when administered to a human. 

30 In certain embodiments, the pharmaceutical compositions may be delivered by 

intranasal sprays, inhalation, and/or other aerosol delivery vehicles. Methods for 
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delivering genes, nucleic acids, and peptide compositions directly to the lungs via nasal 
aerosol sprays has been described, e.g., in U. S. Patent 5,756,353 and U. S. Patent 
5,804,212. Likewise, the delivery of drugs using intranasal microparticle resins 
(Takenaga et aL, J Controlled Release 1998 Mar 2;52(l-2):81-7) and lysophosphatidyl- 
5 glycerol compounds (U. S. Patent 5,725,871) are also well-known in the 
pharmaceutical arts. Likewise, illustrative transmucosal drug delivery in the form of a 
polytetrafluoroetheylene support matrix is described in U. S. Patent 5,780,045. 

In certain embodiments, liposomes, nanocapsules, micropeuticles, lipid 
particles, vesicles, and the like, are used for the introduction of the compositions of the 
10 present invention into suitable host cells/organisms. In particular, the compositions of 
the present invention may be formulated for delivery either encapsulated in a lipid 
particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like. 
Alternatively, compositions of the present invention can be bound, either covalently or 
non-covalently, to the surface of such carrier vehicles. 

15 The formation and use of liposome and liposome-like preparations as potential 

drug carriers is generally known to those of skill in the art (see for example, Lasic, 
Trends Biotechnol 1998 Jul;16(7):307-21; Takakura, Nippon Rinsho 1998 
Mar;56(3):691-5; Chandran et aL, Indian J Exp Biol. 1997 Aug;35(8):801-9; Margalit, 
Crit Rev Ther Drug Carrier Syst. 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. 

20 Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 
5,795,587, each specifically incorporated herein by reference in its entirety). 

Liposomes have been used successfully with a number of cell types that are 
normally difficult to transfect by other procedures, including T cell suspensions, 
primary hepatocyte cultures and PC 12 cells (Renneisen et aL, J Biol Chem. 1990 Sep 

25 25;265(27): 16337-42; MuUer et al, DNA Cell Biol. 1990 Apr;9(3):221-9). In addition, 
liposomes are free of the DNA length constraints that are typical of viral-based delivery 
systems. Liposomes have been used effectively to introduce genes, various drugs, 
radiotherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, into a variety of cultured cell lines and animals. Furthermore, he use of 

30 liposomes does not appear to be associated with autoimmune responses or unacceptable 
toxicity after systemic delivery. 
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In certain embodiments, liposomes are fomied from phospholipids that are 
dispersed in an aqueous medium and spontaneously form multilamellar concentric 
bilayer vesicles (also termed multilamellar vesicles (MLVs). 

Alternatively, in other embodiments, the invention provides for 
5 pharmaceutically-acceptable nanocapsule formulations of the compositions of the 
present invention. Nanocapsules can generally entrap compounds in a stable and 
reproducible way (see, for example, Quintanar-Guerrero et aL, Drug Dev Ind Pharm. 
1998 Dec;24(12): 11 13-28). To avoid side effects due to intracellular polymeric 
overloading, such ultrafine particles (sized around 0.1 |im) may be designed using 
10 polymers able to be degraded in vivo. Such particles can be made as described, for 
example, by Couvreur etal, Crit Rev Ther Drug Carrier Syst. 1988;5(l):l-20; zur 
Muhlen et a/., Eur J Pharm Biopharm. 1998 Mar;45(2): 149-55; Zambaux et al. J 
Controlled Release. 1998 Jan 2;50(l-3):31-40; and U. S. Patent 5,145,684. 

15 Cancer Therapeutic Methods 

In further aspects of the present invention, the pharmaceutical compositions 
described herein may be used for the treatment of cancer, particularly for the 
immunotherapy of colon or colorectal cancer. In other embodiments the compositions 
can be used to treat breast, or non-small cell lung carcinoma. Typically the 

20 composition will be useful for treating patients whose cancers express cripto antigen 
and/or whose metastases express cripto antigen. Within such methods, the 
pharmaceutical compositions described herein are administered to a patient, typically a 
warm-blooded animal, preferably a human. A patient may or may not be afflicted with 
cancer. Accordingly, the above pharmaceutical compositions may be used to prevent 

25 the development of a cancer or to treat a patient afflicted with a cancer. Pharmaceutical 
compositions and vaccines may be administered either prior to or following surgical 
removal of primary tumors and/or treatment such as administration of radiotherapy or 
conventional chemotherapeutic drugs. As discussed above, administration of the 
pharmaceutical compositions may be by any suitable method, including administration 

30 by intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, intradermal, 
anal, vaginal, topical and oral routes. 

-63- 



VU60111 

Within certain embodiments, immunotherapy may be active inmiunotherapy, in 
which treatment relies on the in vivo stimulation of the endogenous host immune 
system to react against tumors with the administration of immune response-modifying 
agents (such as polypeptides and polynucleotides as provided herein). 

5 Within other embodiments, immunotherapy may be passive immunotherapy, in 

which treatment involves the delivery of agents with established tumor-immune 
reactivity (such as effector cells) that can directly or indirectly mediate antitumor 
effects and does not necessarily depend on an intact host immune system. Examples of 
effector cells include T cells as discussed above, T lymphocytes (such as CD8+ 

10 cytotoxic T lymphocytes and CD4+ T-helper tumor-infiltrating lymphocytes), killer 
cells (such as Natural Killer cells and lymphokine-activated killer cells), B cells and 
antigen-presenting cells (such as dendritic cells and macrophages) expressing a 
polypeptide provided herein. T cell receptors and antibody receptors specific for the 
polypeptides recited herein may be cloned, expressed and transferred into other vectors 

15 or effector cells for adoptive inmiunotherapy. 

Effector cells may generally be obtained in sufficient quantities for adoptive 
inmiunotherapy by growth in vitro, as described herein. Culture conditions for 
expanding single antigen-specific effector cells to several billion in number with 
retention of antigen recognition in vivo are well known in the art. Such in vitro culture 

20 conditions typically use intermittent stimulation with antigen, often in the presence of 
cytokines (such as IL-2) and non-dividing feeder cells. As noted above, 
immunoreactive polypeptides as provided herein may be used to rapidly expand 
antigen-specific T cell cultures in order to generate a sufficient number of cellsNfor 
immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, 

25 monocyte, fibrobleist and/or B cells, may be pulsed with immunoreactive polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 
in the art. For example, antigen-presenting cells can be transfected with a 
polynucleotide having a promoter appropriate for increasing expression in a 
recombinant virus or other expression system. Cultured effector cells for use in therapy 

30 must be able to grow and distribute widely, and to survive long term in vivo. Studies 
have shown that cultured effector cells can be induced to grow in vivo and to survive 
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long term in substantial numbers by repeated stimulation with antigen supplemented 
with IL-2 (see, for example, Cheever et al.. Immunological Reviews 157:177, 1997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting cells taken from a patient and clonally propagated ex 
5 vivo for transplant back into the same patient. Transfected cells may be reintroduced 
into the patient using any means known in the art, preferably in sterile form by 
intravenous, intracavitary, intraperitoneal or intratumor administration. 

Routes and frequency of administration of the therapeutic compositions 
described herein, as well as dosage, will vary from individual to individual, and may be 

10 readily established using standard techniques. In general, the pharmaceutical 
compositions and vaccines may be administered by injection (e.g., intracutaneous, 
intramuscular, intravenous, intradermally or subcutaneous), intranasally (e.g., by 
aspiration) or orally. Preferably, between 1 and 10 doses may be administered over a 
52 week period. Preferably, 6 doses are administered, at intervals of 1 month, and 

15 booster vaccinations may be given periodically thereafter. Alternate protocols may be 
appropriate for individual patients. A suitable dose is an amount of a compound that, 
when administered as described above, is capable of promoting an anti-tumor inmiune 
response, and is at least 10-50% above the basal (i.e., untreated) level. Such response 
can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine- 

20 dependent generation of cytolytic effector cells capable of killing the patient's tumor 
cells in vitro. Such vaccines should also be capable of causing an immune response 
that leads to an improved clinical outcome (e.g., more frequent remissions, complete or 
partial or longer disease-free survival) in vaccinated patients as compared to non- 
vaccinated patients. In general, for pharmaceutical compositions and vaccines 

25 comprising one or more polypeptides, the amount of each polypeptide present in a dose 
ranges from about 25 |Xg to 5 mg per kg of host. Suitable dose sizes will vary with the 
size of the patient, but will typically range from about 0. 1 ml to about 5 ml. 

In general, an appropriate dosage and treatment regimen provides the active 
compound(s) in an amount sufficient to provide therapeutic and/or prophylactic benefit. 
30 Such a response can be monitored by establishing an improved clinical outcome (e.g., 
more frequent remissions, complete or partial, or longer disease-free survival) in treated 
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patients as compared to non-treated patients. Increases in preexisting immune 
responses to a tumor protein generally correlate with an improved clinical outcome. 
Such immune responses may generally be evaluated using standard proliferation, 
cytotoxicity or cytokine assays, which may be performed using samples obtained from 
5 a patient before and after treatment. 

One embodiment of the invention provides an immunogentic fragment of a 
Cripto polypeptide wherein the immunogenic fragment is immunologically reactive 
with an antibody and/or T-cell that reacts with a polypeptide of SEQ ID NO:3 or SEQ 
ID NO:4, wherein said inmiunogenic fragment is at least 20 contiguous amino acids in 
10 length, comprises SEQ ID NO:97 and does not contain SEQ ID NO: 11 and SEQ ID 
NO: 12. In another aspect of the invention, the fragment is part of a larger fusion 
protein. In another aspect of the invention, the fragment is chemically conjugated to a 
carrier protein. 

In another aspect of the invention, an isolated polynucleotide encoding the 
15 immunogenic fragment is provided. In yet another aspect, an expression vector 
comprising the isolated polynucleotide operably linked to an expression control 
sequence is provided. In another aspect, the invention provides a recombinant viral or 
bacterial delivery system comprising the isolated polynucleotide. In another aspect of 
the invention, a host cell comprising the isolated polynucleotide is provided. 

20 In yet another aspect of the invention, an inmiunogenic composition comprising a 

first component comprising a physiologicedly acceptable carrier, immunostimulant, and 
adjuvant, and a second component comprising an immunogenic fragment comprising 
SEQ ID NO:97 is provided. In one aspect, the immunostimulant is a TH-1 inducing 
adjuvant. In another aspect, the TH-1 inducing adjuvant comprises 3D-MPL, QS21, a 

25 mixture of QS21 and cholesterol, and a CpG oligonucleotide. In another embodiment 
of the invention, a method for treating cancer in a patient is provided, comprising 
administering to the patient an immunogenic composition comprising SEQ ID NO:97. 

In another embodiment, a method for stimulating T cells specific for Cripto is 
provided, comprising contacting said cells with an inununogenic fragment of a Cripto 
30 polypeptide wherein the immunogenic fragment is immunologically reactive with an 
antibody and/or T-cell that reacts with a full-length polypeptide of SEQ ID NO:3 or 
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SEQ ID NO:4, wherein said immunogenic fragment is at least 20 contiguous amino 
acids in length, comprises SEQ ID NO:97 and does not contain SEQ ID NO: 1 1 or SEQ 
ID NO: 12. In another embodiment an isolated T cell population is provided. 

In another embodiment, the present invention provides, a method for inhibiting the 
5 development of a cancer in a patient, comprising the steps of: (a) incubating CD4+ 
and/or CD8+ T cells isolated from a patient with SEQ ID NO:97; and (b) administering 
to the patient an effective amount of the T cells, and thereby inhibiting the development 
of a cancer in the patient. In another embodiment the T cells are allowed to proliferate. 

In yet another embodiment, a method for producing an immunogenic response to 
10 Cripto in an animal is provided comprising administering a first component comprising 
a polynucleotide encoding SEQ ID NO:97 and does not encode SEQ ID NO:3 or SEQ 
ID NO:4 to the animal. In one aspect, the polynucleotide is recombinant DNA. In 
another aspect, the first component is admixed with a second component comprising an 
immunostimulant, adjuvant and physiologically acceptable carrier. In yet another 
15 aspect, the immunogenc composition is repeatedly administered. In yet another aspect, 
first component is admixed with a second component comprising a physiologically 
acceptable carrier, adjuvant and immunostimulant prior to administering the first 
component to the animal. It will be understood by the skilled artisan that as a result of 
the degeneracy of the genetic code, there are many nucleotide sequences that encode a 
20 polypeptide as described herein. 

As used herein "cripto" refers to polypeptides having about 188 amino acids and 
being expressed in carcinoma cells. In addition, "cripto" refers to polypeptides having 
at least 70%, but preferably 80%, and more preferably 90%, and more preferably 95%, 
and yet most preferably 98% sequence identity to SEQ ID NOs: 3 or 4 or variants 
25 thereof, including Cripto 1 and Cripto 3. 
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Examples 

The following examples illustrate various aspects of this invention. These 
examples do not limit the scope of this invention which is defined by the appended 
claims. 

EXAMPLE la 

EXPRESSION OF IMMUNOREACTIVE CRIPTO-1 IN HUMAN LESIONS 



Tissue 



Colon 

Stomach 
Pancreas 

Gall Bladder 

Breast 
Non Small Cell Lung 



Non-involved 
Epithelium 

25/193(13) 

1/37 (3) 
10/58(17) 

N.D. 

5/33 (15) 



Premalignant Lesion 



TA 

26/65(40) 
IM 

16/30 (53) 

Hyperplasias 

6/9(67) 



TVA 

10/13(77) 



Adenomas 

4/7(58) 
DCIS 
26/55 (47) 



Carcinoma 

122/168(73)** 
17/37 (46)** 

58/98 (59)** 

89/132 (68)** 

497/631 (79)** 
178/195 (91)** 



Tissue 



Ovary 



Endometrium 



Cervix 
Testis 



Non-involved 
Epithelium 

6/7 (86) post- 
menopausal 



Cystadenomas 

Serous 
6/14(43) 



Mucinous 

0/7 



3/9 (33) 
10/28 (36) post-menopausal 



4/25 (17) 
0/3 



3/8 (38) 



Serous 



4/10(40) 



4/8 (50) 
Borderline: 10/10 



Mucinous 



4/5 (80) 

23/40 (58) 
25/48 (52) 
9/10(90) 



Hyperplasias 

18/30 (60) 



Embryonal Carcinomas 

19/19(100)** 

Adrenal Cortex 
Bladder 0/6 
Renal 
Prostate 

TA=Tubular adenoma 
TVC=Tubulo villous adenoma 

IM=:Intestinal metaplasia 

** Statistically significant expresson in carcinomas over non-involved tissues 



53/91 (58)* 

68/96 (71)* 
40/74 (54)* 
29/51 (57)** 
Seminomas 
10/32 (31) 
1/3 (33) 
23/39 (60)** 
0/18 
0/9 



10 Example 1.6 : Over-expression of Cripto in cancerous tissues. 
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Real-time RT-PCR (U. Gibson. 1996. Genome Research: 6,996) is used to compare 
mRNA transcript abundance of the target protein in a panel of normal and tumor tissues 
and/or cell lines. This analysis is critical to establish the tumor specificity of Cripto 
5 expression, which is an important criterion a good vaccine candidate must fulfil. 

Total RNA is extracted from snap frozen biopsies or cell lines using TriPure reagent 
(Roche), Total RNA from normal tissues is also purchased from InVitrogen. Poly-A+ 
mRNA is purified from total RNA after DNAase treatment using oligo-dT magnetic beads 
10 (Dynal). Quantification of the mRNA is performed by spectrofluorimetry (VersaFluor, 
BioRad) using RiboGreen dye (Molecular Probes). 

TaqMan primers (forward primer sequence : TGGGTAGGAAAGAGGAAGCAAAT, SEQ 
ID NO:7; reverse primer sequence : TGCTTCTCTACCACCACCTAATCA, SEQ ID 
15 NO:8) and probe for real-time RT-PCR amplification are designed with the Perkin- 
Elmer Primer Express software using default options for TaqMan amplification 
conditions. 

Real-time reactions are assembled according to standard PGR protocols using 2 ng of 
20 reverse transcribed mRNA (Expand RT, Roche) for each reaction. Either SybrI or 
TaqMan detection is undergone, depending on the evaluated sample. In case of SybrI 
detection, SybrI dye (Molecular Probes) is added at a final dilution of 1/75000 for real- 
time detection, and TaqMan probe is omitted. Amplification (40 cycles) and real-time 
detection is performed in a Perkin-Elmer Biosystems PE7700 system using conventional 
25 instrument settings. Ct values are calculated using the PE7700 Sequence Detector 
Software. Ct values are obtained from each tissue sample for the target mRNA (CtX) and 
for the actin mRNA (CtA). 

As the efficiency of PGR amplification under the prevailing experimental conditions is 
30 close to the theoretical amplification efficiency, 2^^^^"^^^ value is an estimate of the 
relative target transcript level of the sample, standardized with respect to Actin transcript 
level. A value of 1 thus suggests the candidate antigen and Actin have the same expression 
level. 

35 RT-PCR analysis, using SybrI detection, was performed on a set of colon tumor and 

matched normal colon from 6 different patients and 48 normal tissue samples. A TaqMan 

detection was run on a set of colon tumor and matched normal colon from 6 other patients 

(reactions were run in triplicates) and 48 normal tissue samples. 

Tested normal tissues (and the abbreviations used in graphics) are shown below : 
40 ^ Adrenal gland (Ad^Gl) 

^ Aorta (Ao) 

^ Bladder (Bl) 

^ Bonne marrow Bo_Ma 

^ Brain (Bra, Bral , Bra2, Bra3, Bra4, Bra5) 
45 ^ Cervix (Ce) 
Colon (Co) 
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Fallopian tube (Fa_Tu) 
^ Heart (He) 
^ neum(n) 
^ Kidney (Ki) 
5 Liver(Li, Lil,Li2) 

Lung (Lu) 
^ Lymph node (Ly_No) 

Esophagus (Oe) 
^ Ovary (Ov) 
10 ^ Pancreas (Pa, Panel, Panc2) 

Parathyrois (Pa_Thy) 
^ Placenta (PI) 
^ Prostate (Pr) 
^ Rectum (Re) 
15 >^ Skin(Sk) 

^ Skeletal muscle (Sk_Mu) 

Small intestine (Sm_In) 
^ Spleen (Sp) 
^ Stomach (St) 
20 ^ Testis (Te) 

^ Thyroid (Thyr, Thy, Thy l,Thy2) 
^ Thymus (Thyml,)Thym2 
^ Trachea (Tr, Tra) 

25 Real-time RT-PCR reactions, using SybrI detection, were also performed on a set of 7 
lung cell lines: 

. CRL-5803 (Carcinoma, Non-Small Cell Lung Cancer, large cell, neuroendocrine, 
metastatic site : lymph node) 

CRL-5807 (Bronchioalveolar carcinoma, Non-Small Cell Lung Cancer) 
30 ^ CRL-58 10 (Adenocarcinoma, Non-Small Cell Lung Cancer) 
CRL-58 15 (Carcinoid, lung bronchus) 
^ CRL-5865 (Adenocarcinoma, metastatic site : pleural effusion) 

CRL-9609 (Normal lung, bronchus, epithelial, virus transformed) 
^ HTB-177 (Carcinoma, large cell lung cancer, pleural effusion) 
35 and 2 colon cell lines: 

^ CRL-2159 (Carcinoma, Cecum, Dukes' B) 
^ CCL-250 

Fresh biopsy normal lung tissues (Lu(ucl), Lu(IVG)) and a lung tumor tissue (LuTum) 
were also performed as control. 



40 



RT-PCR results on colorectal biopsies and normal tissues are shown in Figures 1, 2, 3 and 
4, and in Table 1. RT-PCR results on cell lines are shown in Figure 5. 
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Sybr detection' 


TaqMan detection' 


Colorectal tumor versus adjacent normal colon^ 


Number of over-expressing 
patients 


5/6 


5/6 


Average over-expression fold 
in over-expressing patients 


200 


90 


Median over-expression fold 
(minimum-maximum) 


64 (22-724) 


20 (4-397) 




Colorectal tumor versus average normal tissues^ 


Number of over-expressing 
patients 


5/6 


4/6 


Average over-expression fold 
in over-expressing patients 


10 


12 


Median over-expression fold 
(minimum-maximum) 


1 1 (3-22) 


7 (3-32) 


Normal tissues with a high 
transcript level (normal-to- 
tumor ratio) 


Spleen (0.5) 


Spleen (0.75), ovary (2)"* 



Table 1 : Cripto expression in colorectal tumors and normal tissues. 

1. Transcript levels were calculated in colorectal tumors and a panel of normal tissues 
5 using 2 detection techniques : TaqMan and Sybr. Regarding Cripto, TaqMan 

detection involved 6 patients and measures were done in triplicates, whereas Sybr 
detection was undergone on 6 different patients. 

2. Transcript level in colorectal tumors was compared to both matched normal colon 
and average of normal tissue transcript levels. 

10 3. A normal tissue has a high transcript level when it is higher than one fifth of 
colorectal tumors transcript level. 
4. Ovary has not been evaluated in Cripto Sybr experiment. 

Table 1, Figures 1, 2, 3 & 4 clearly show that Cripto, while being marginally expressed in 
15 normal adult tissues, is highly over-expressed in a majority of colorectal tumors, with an 
over-expression rate of more than ten fold. Moreover, Figure 5 indicates Cripto is 
dramatically over-expressed in a lung tumor cell line (CRL-5815). Cripto tumor 
associated antigen is therefore a suitable vaccine candidate to treat both colorectal and 
lung cancer patients. 

20 

Example 2 : Cloning of Cripto-l c-DNA from lung tumor cell lines. 
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Total RNA was extracted using TriPure reagent from 10^ cultured cells of 7 different 
lung cell lines (see section 1 for the complete list of cell lines). Total RNAs were 
pooled, and mRNA was purified from pooled total RNA on oligo-d(T) magnetic beads 
(Dynal). 250 ng of mRNA were used for cDNA synthesis. Quantification of the mRNA 
5 is performed by spectrofluorimetry (VersaFluor, BioRad) using RiboGreen dye 
(Molecular Probes). cDNA was synthesized using the GeneRacer technology 
(Invitrogen) which ensures the amplification of only full-length transcripts. mRNA was 
treated with GIF. mRNA 5' ends were decapped with TAP (Tobacco Acid 
Pyrophosphatase) and were ligated to a specific RNA oligonucleotide. The ligated 

10 mRNA was reverse transcribed into cDNA using an oligod(T)-tailed primer. 
Amplification of cDNA was performed using both GeneRacer flanking primers 
(Advantage, Clontech). Cripto amplification was performed on 10 ng of GeneRacer 
cDNA using gene specific PGR primers (forward primer sequence : 
CGTCCAAGGCCGAAAGCCCTCCAGTT, SEQ ID NO:9; and reverse primer sequence : 

15 TTGGGAGAGGGCAGGGCAAAGAAGTAAGAA, SEQ ID NO: 10). PGR reaction was done 
with Advantage II Taq DNA polymerase (Glontech) under standard conditions. PGR 
product was cloned in pGR4-TOPO plasmid (Invitrogen). Amplified sequence (SEQ ID 
NO:95) was shown to display a variation at codon 22 (SEQ ID NO:96): Ala (GGG) 
instead of Val (GTG) in the native version (SeqID6). Native version was restored by 

20 PGR mutagenesis. 

Example 3 ; Immunogenicitv of Cripto tumor-associated antigen in animal models 

The immunogenicity of the antigen of the present invention can be verified by 
25 inmiunizing rabbits and mice using various means of immunization. Indeed, 
inmiunization with Gripto forms, either peptide or recombinant protein could induce 
humoral immune response with the generation of specific antibodies against Gripto 
and/or could induce a Gripto specific cellular immune response. In vivo delivery of 
Gripto protein using for instance, naked DNA in an appropriate vector encoding Gripto 
30 or fragments of Gripto, Gripto genes delivered by a viral vector encoding Gripto or 
fragments of Gritpo, could also be useful to demonstrate Gripto immunogenicity. 

3.1 : Synthetic peptide immunization. 

35 The synthetic peptides from human Gripto- 1 amino-acid sequence selected to immunize 
rabbits are GHQEFARPSRGYL (13 amino acids, SEQ ID NO: 11), and 
QEEPAIRPRSSQRVPPMG (18 amino acids, SEQ ID NO: 12). Synthetic peptides are 
then conjugated to a carrier protein (KLH). Gonjugates are formulated with Freund's 
adjuvant, and two rabbits are inmiunized with each of the conjugates. Four weeks after 

40 the second immunization and four weeks after the third immunization, a blood sample 
is taken. Anti-Gripto antibody titers are estimated in the serum by ELISA and/or 
Western Blot following standard protocols (see section 3.5). 

3.2 : Nucleic acids immunization. 

45 
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pcDNA3.1 vector (Invitrogen) is used to construct the vaccinating plasmid. To promote 
secretion of the in vivo translated protein and to therefore induce the humoral response 
against the present invention antigen, nucleic acid sequence encoding Cripto-l with its 
own signal peptide is inserted into the vector polycloning site. The recombinant 
5 expression plasmid is used to transform a host E. coli strain such as BL21. 

The above recombinant strain is grown in conventional cell culture medium. Bacteria 
are harvested before reaching the stationary phase. Plasmid preparation using Quiagen 
system is undergone for injection in mice. 

10 

Six to eight weeks-old Balb/c mice receive intramuscular injections of recombinant 
expression plasmid. Two weeks after the last injection, a blood sample is taken. Titers 
of specific antibodies elicited against the present invention antigen are determined by 
ELISA and/or Western Blot (see section 3.5). 

15 

3.3 Viral vector immunization using Adenoviruses* 

Recombinant adenoviruses are effective vectors for gene-based vaccination because 
they are capable of eliciting humoral and cellular immune responses against the 

20 encoded antigen. The nucleotide sequence coding for Cripto-l protein with its own 
signal sequence could be inserted in an appropriated Adeno derivative viral vector. The 
adenoviral recombinant vector could be administrated to mice by different routes 
(intramuscular, intranasal, intradermal, subcutaneous or intraeperitoneal) After two 
week , blood samples could be taken and the titer of antibodies elicited examined. 

25 Additional experiments to measure the cellular immune response could also be 
performed. 

3.4 : Recombinant protein immunization. 

30 3.4.1 : Expression and purification of Cripto-l recombinant protein 

Expression in microbial hosts, is used to produce the whole protein or fragments of the 
invention antigen for immunization purposes. Recombinant proteins may be expressed 
in two microbial hosts, E. coli and in yeast (such as Saccharomyces cerevisiae or 
35 Pichia pastoris). This allows the selection of the expression system with the best 
features for this particular antigen production. 

The expression strategy first involves the design of the primary structure of the 
recombinant antigen. In general, an expression fusion partner (EFP) to improve levels 
40 of expression and/or an immune fusion partner (IFP) to modulate the immunogenic 
properties of the antigen, are placed at the N-terminal extremity. In addition, an affinity 
fusion partner (AFP) useful for facilitating further purification is included at the C- 
terminal end. 
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When the recombinant strains are available, the recombinant product is characterized 
by the evaluation of the level of expression and the prediction of further solubility of 
the engineered protein by analysis of its behavior in the crude extract. 

5 After growth in appropriate culture medium and induction of the recombinant protein 
expression, total extracts are analyzed by SDS-PAGE. The recombinant proteins are 
visualized in stained gels and identified by Western blot analysis using the specific 
anti-peptide antibodies generated by peptide immunization in rabbit (see section 3.1). 

10 A comparative evaluation of the different versions of the expressed antigen and 
expression hosts will allow the selection of the most promising candidate and host that 
is to be used for further purification and further immunological evaluation. 

The purification schemes used a Histidine affinity tail in the recombinant protein. In a 
15 typical experiment the disrupted cells are filtered and the acellular extracts loaded onto 
an Ion Metal Affinity Chromatography (IMAC; Ni++NTA from Qiagen) that will 
specifically retain the recombinant protein. The retained proteins are eluted by 0-500 
mM Imidazole gradient (possibly in presence of a detergent) in a phosphate buffer. 

20 

3.4.2 : Protein immunization 

Rabbits are immunized, intramuscularly several times at several week intervals with 
recombinant purified protein, formulated in the adjuvant 3D-MPL/QS2L Three weeks 
25 after each immunization, blood samples are taken. Anti-Cripto antibody titer is 
estimated in the serum by ELISA. The specificity of the anti-Cripto antibodies 
generated is tested by Western Blot (see section 3.5) using the purified protein and 
including appropriated controls. 

30 3.5 : Immunological response assays of Cripto-immunized animals. 

Humoral response to Cripto immunization is assessed by measuring Cripto specific 
antibody titers in animal sera using ELISA and Western Blot. The following material 
harboring Cripto- 1 derived peptides or full protein could be used for such test : 
35 ^ Cripto synthetic peptides (see section 3.1 for possible peptides), or 

protein extracts from cultures of Cripto-expressing cell lines (see section 1 for 

possible cell lines), or 

^ lysates of COS cells that have been engineered to transiently express a Cripto 

recombinant plasmid (see below), or 
40 ^ protein extracts of recombinant E. coli or yeast strains (see section 3.3.1 for 

recombinant strain generation), or 
^ purified recombinant antigen (see section 3.3.1 for antigen purification). 
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Transient expression of Cripto in COS cells is obtained by transiently transfecting COS 
cells with recombinant pcDNAS.l plasmid prepared for nucleic acid vaccination (see 
section 3.2). 

5 For ELISA reactions, one of the above mentioned antigen is coated on microtiter plates 
(purified recombinant protein is preferred). ELISA is then performed using standard 
protocol. 

Western Blots are realized under standard conditions with one of the above mentioned 
10 antigen (purified recombinant protein is preferred, synthetic peptides are not used) 

The cellular response can also be assessed by stimulating in vitro vaccinated mouse 
spleen cells with the peptides used to immunize the mice, or with antigen-derived 
overlapping peptides covering the whole antigen sequence. 

15 

Example 4 ; Demonstration of existing Cripto specific human T-cell by in vitro 
priming 

Immunological relevance of Cripto- 1 can be further confirmed by in vitro priming of 
20 human T cells. All T cell lymphocytes, T cell lines and dendritic cells are derived from 
PBMCs (peripheral blood mononuclear cells) of healthy donors or cancer patients 
(preferred donors are from HLA-A2 subtype). 

Epitopes binding of HLA alleles prediction; 

25 The HLA Class I binding peptide sequences (nonamers, decamers) are predicted either 
by the Parker's algorithm (Parker, et al, J, Immunol, 152: 163 (1994)) 
(http://bimas.dcrt.nih.gov/molbio/hla_bind/), and the Rammensee method (Rammensee 
et aL (1997). Landes Bioscience (1997); Rammensee et aL Immunogenetics 41,178-228 
(1995)). (http://syfpeithi.bmi-heidelberg.com/Scripts/MHCServer.dll/EpPredict.htm). 

30 

The HLA Class II binding peptide sequences (nonamers) are predicted using the 
Tepitope algorithm (Stumiolo et al. Nat BiotechnoL 17: 555-61 (1999). 

CD8+ T-cell response: t 
35 Two strategies to raise the CD8+ T cell lines are used: a peptide-based approach and a 
whole gene-based approach. Both approaches require the full-length cDNA of interest 
in the correct reading frame to be cloned in an appropriate delivery system and to be 
used to predict the sequence of the HLA binding peptides. 

40 Peptide-based approach: 

For this approach, an HLA-A2.1/Kb transgenic mouse model is used for screening of 
the HLA-A2.1 peptides. Briefly, transgenic mice are immunized with adjuvanted HLA- 
A2 peptides, those able to induce a CDS response (as defined by an efficient lysis or g- 
WN production on peptide-pulsed target cells) are further analyzed in the human 

45 system. 
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Human dendritic cells (cultured according to Romani et aL J Exp Med. 180:83- 
93(1994)) will be pulsed with the selected peptides and used to stimulate CD8+-sorted 
T cells (by Facs). After several weekly stimulation, the CD8+ lines are first tested on 
peptide-pulsed autologous BLCL (EBV-B transformed cell lines). To verify the proper 
5 in vivo processing of the peptide, the CD8+ lines are then tested on cDNA-transfected 
tumor cells (HLA-A2 transfected LnCaP, Skov3 or CAM A tumor cells). 

Whole gene-based approach : 

CD8-I- T cell lines are primed and stimulated with either gene-gun transfected dendritic 
10 cells, retrovirally transduced B7.1 -transfected fibroblasts, recombinant pox virus (Kim, 
etal J Immunother. 20:21 6'2%6 (1997)) or adenovirus (Butterfield, et aL J ImmunoL 
161:5607-13 (1998)). infected dendritic cells. Virus infected cells are very efficient to 
present antigenic peptides since the antigen is expressed at high level but can only be 
used once to avoid the over-growth of viral T cells lines. 

15 

After alternated stimulation, the CD8+ lines are tested on cDNA-transfected tumor cells 
as indicated above. Peptide specificity and identity is determined to confirm the 
immunological validation of antigen of the present invention. 

20 CD4+ T-cell response: 

Simileirly, the CD4+ T-cell immune response can also be assessed. Generation of 
specific CD4+ T cells is made using dendritic cells loaded with recombinant purified 
protein or peptides to stimulate the T-cells. 

25 Results : 

A- Prediction of Class I epitopes using the Parker method. ' 



A-1 Cripto-1 and -3 Class I epitopes. 



HLA allele 


Epitope size 


Start 
position 


Sequence 


Score 


SEQ ID NO: 


A68 


Decamer 


12 


SVIWIMAISK 


240.000 


SEQIDNO:13 


A68 


Decamer 


117 


SVPHDTWLPK 


120.000 


SEQ ID NO: 14 


B2705 


Nonamer 


33 


HQEFARPSR 


100.000 


SEQ ID NO: 15 


B2705 


Nonamer 


59 


IRPRSSQRV 


600.000 


SEQ ID NO: 16 


B2705 


Nonamer 


72 


IQHSKELNR 


100.000 


SEQ ID NO: 17 


B2705 


Nonamer 


103 


GRNCEHDVR 


1000.000 


SEQ ID NO: 18 


B2705 


Nonamer 


110 


VRKENCGSV 


600.000 


SEQ ID NO: 19 


B2705 


Nonamer 


138 


LRCFPQAFL 


2000.000 


SEQ ID NO:20 


B2705 


Nonamer 


161 


SRTPELPPS 


200.000 


SEQIDNO:21 


B2705 


Decamer 


65 


QRVPPMGIQH 


200.000 


SEQ ID NO:22 


B2705 


Decamer 


79 


NRTCCLNGGT 


200.000 


SEQ ID NO:23 


B2705 


Decamer 


103 


GRNCEHDVRK 


2000.000 


SEQ ID NO:93 


B2705 


Decamer 


136 


GQLRCFPQAF 


100.000 


SEQ ID NO:24 


B2705 


Decamer 


161 


SRTPELPPSA 


200.000 


SEQ ID NO:94 


B5101 


Decamer 


143 


QAFLPGCDGL 


110.000 


SEQ ID NO:25 


B5102 


Decamer 


143 


QAFLPGCDGL 


302.500 


SEQ ID NO:25 
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TJ C 1 AO 

DDiUZ 


Decamer 


1 jU 


DGLVMDEHLV 


1 OA r\nf\ 

1 20.001) 


SbQ LD NO:2o 


SjOVJ 


iNonamer 








Olil^ LL/ iNV-/.Z/ 


B60 


Nonamer 


76 


KELNRTCCL 


320.000 


SEQ ID NO:28 


B62 


Nonamer 


137 


QLRCFPQAF 


240.000 


SEQ ID NO:29 


B62 


Decamer 


136 


GQLRCFPQAF 


160.000 


SEQ ID NO:24 


B7 


Nonamer 


169 


SARTTTFML 


120.000 


SEQ ID NO:30 



A-2 Cripto 



1 specific class I epitopes 



"WWW A ^ILL 

HLA allele 


Epitope size 


Start 
position 


Sequence 


Score 


SEQ ID NO: 


A0201 


Nonamer 


5 


KMARFSYSV 


668.086 


SEQ ID NO: 31 


A0201 


Decamer 


16 


IMAISKVFEL 


349.885 


SEQ ID NO:32 


A0201 


Decamer 


13 


VIWIMAISKV 


310.361 


SEQ ID NO:33 


A0201 


Decamer 


175 


FMLVGICLSI 


128.242 


SEQ ID NO:34 


B2702 


Nonamer 


7 


ARFSYSVIW 


500.000 


SEQ ID NO:35 


B2702 


Nonamer 


3 


CRKMARFSY 


200.000 


SEQ ID NO:36 


B2702 


Decamer 


7 


ARFSYSVIWI 


300.000 


SEQ ID NO:37 


B2702 


Decamer 


37 


ARPSRGYLAF 


200.000 


SEQ ID NO:38 


B2705 


Nonamer 


7 


ARFSYSVIW 


1000.000 


SEQ ID NO: 35 


B2705 


Nonamer 


3 


CRKMARFSY 


1000.000 


SEQ ID NO:36 


B2705 


Nonamer 


170 


ARTTTFMLV 


600.000 


SEQ ID NO:39 


B2705 


Nonamer 


37 


ARPSRGYLA 


200.000 


SEQ ID NO:40 


B2705 


Decamer 


7 


ARFSYSVIWI 


3000.000 


SEQIDNO:37 


B2705 


Decamer 


37 


ARPSRGYLAF 


1000.000 


SEQ ID NO:38 


B2705 


Decamer 


3 


CRKMARFSYS 


200.000 


SEQIDNO:41 


B4403 


Decamer 


34 


QEFARPSRGY 


120.000 


SEQ ID NO:42 


B5101 


Nonamer 


6 


MARFSYSVI 


286.000 


SEQ ID NO:43 


B5101 


Decamer 


169 


SARTTTFMLV 


1 10.000 


SEQ ID NO:44 


B5102 


Nonamer 


17 


MAISKVFEL 


150.000 


SEQ ID NO:45 


B5102 


Nonamer 


6 


MARFSYSVI 


100.000 


SEQ ID NO:43 


B5103 


Nonamer 


6 


MARFSYSVI 


100.000 


SEQ ID NO:43 


B5103 


Decamer 


169 


SARTTTFMLV 


121.000 


SEQ ID NO:44 


B7 


Nonamer 


36 


FARPSRGYL 


180.000 


SEQ ID NO:46 
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A-3 Cripto-3 specific class I epitopes. 



HLA allele 


f^nitnne ^i7e 


Start 
nnsiitinn 


Seauencp 


Score 


SEO ID NO- 


A0201 


Nonamer 


5 


KMVRFSYSV 


668 086 


SEO ID NO-47 

\J1^\^ X^V^,"T / 


A0201 


Nonamer 


89 


CMLESFCAC 


103.417 


SEO ID NO-48 


A0201 


Decamer 


16 


IMAISKAFEL 


152.124 


vSFO ID NO-49 


A0201 


Oecamer 


175 


FMLAGICLSI 


128.242 


SEO ID NO-50 


B2702 


Nonamer 


7 


VRFSYSVTW 


500 000 


SEO ID NO-51 


B2702 


Nonamer 


3 




200 000 


SFO ID NO-S2 


B2702 


T^pcampr 


7 


V xxir o X V ^ vv ^ 


100 000 


SFO ID NO Sl 

w3X-«V^ JX-/ X^V-'...J«J 


B2702 


T^pcampr 


37 




200 000 


SFO ID NO S4 


B2705 


Nonampr 


3 


PRKMVRF^Y 

\«.XXX>J7X V XxX^ w X 


1 000 000 


SFO ID NO S7 


B2705 


Non a mp r 


7 


V x\.r o X o V X vv 


1000 000 


SFO in NO-SI 


B2705 


N^onampr 


'^7 


ARPCIPnnT.A 

xu\ XT 1^ XWJ 1^ J-l^> 


900 000 


SFO in NO-SS 

oXJV^ XL/ l^W.JJ 


B2705 




170 


AP TTTFMT . A 


900 ooo 


SFO TD Mn*Sfi 


B2705 




7 


\7P F c;Y<^\7T WT 


1000 000 


SFO TD NO- SI 


B2705 


Opr* a mpr 


17 


APPCIRfinT.AF 


1 000 000 


SFO TD NO S4 






SO 


TP PP QnP\/T . 


600 000 

\J\J\J ,\J\J\J 


SFO TD Tvjn S7 


B2705 


Decamer 


3 




200 000 

^ V/ V/ > \_/ v_/v_/ 


SEO ID NO'58 


B2705 


Decamer 


65 


QRVIiPMGIQH 


200.000 


SEQ ID NO:59 


B5101 


Nonamer 


60 


RPRSSQRVL 


120.000 


SEQ ID NO:60 


B5102 


Nonamer 


17 


MAISKAFEL 


150.000 


SEQ IDNO:61 


B7 


Nonamer 


60 


RPRSSQRVL 


800.000 


SEQ ID n6:60 


B7 


Nonamer 


36 


FARPSRGDL 


180.000 


SEQ ID NO:62 



NB : Score is an estimate of half-time of disassociation of a molecule containing this 
subsequence. 



5 B- Prediction of Class I epitopes using the Rammensee method. 



B-1 Cripto-l and -3 Class I epitopes. 



HLA allele 


Epitope size 


Start 
position 


Sequence 


Score 


SEQ ID NO: 


A26 


Nonamer 


179 


GICLSIQSY 


27 


SEQ ID NO:63 


A26 


Decamer 


27 


LVAGLGHQEF 


25 


SEQ ID NO:64 


A26 


Decamer 


121 


DTWLPKKCSL 


25 


SEQ ID NO:65 


A3 


Nonamer 


58 


AIRPRSSQR 


29 


SEQ ID NO:66 


A3 


Nonamer 


13 


VIWIMAISK 


24 


SEQ ID NO:67 


A3 


Decamer 


12 


SVIWIMAISK 


29 


SEQ ID NO: 13 


A3 


Decamer 


117 


SVPHDTWLPK 


24 


SEQ ID NO: 14 


B2705 


Nonamer 


103 


GRNCEHDVR 


25 


SEQ ID NO: 18 


B2705 


Nonamer 


138 


LRCFPQAFL 


24 


SEQ ID NO:20 
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B-2 Cripto-1 specific class I epitopes 



HLA allele 


F^nitnne <iize 


Start 








A0201 


Nonamer 


83 


CLNGGTCML 


27 


SEO ID NO-68 


A0201 


Nonamer 


5 


KMARFSYSV 


25 


SFO ID NO*31 


A0201 


Decamer 


16 


IMAISKVFEL 


28 


SEQ ID NO:32 


A0201 


Decamer 


13 


VIWIMAISKV 


26 


SEQ ID NO:33 


A26 


Nonamer 


35 


EFARPSRGY 


24 


SEQ ID NO:69 


A3 


Nonamer 


66 


RVPPMGIQH 


27 


SEQ ID NO:70 


A3 


Nonamer 


21 


KVFELGLVA 


24 


SEQIDNO:71 


BOS 


Nonamer 


17 


MAISKVFEL 


26 


SEQ ID NO:45 


B5101 


Nonamer 


6 


l^fARFSYSVI 


25 


SEQ ID NO:43 



B-3 Cripto 



3 specific class I epitopes 



HLA allele 


Epitope size 


Start 
position 


Sequence 


Score 


SEQ ID NO: 


A0201 


Nonamer 


83 


CLNGGTCML 


27 


SEQ ID NO:68 


A0201 


Nonamer 


176 


MLAGICLSI 


25 


SEQ ID NO:72 


A0201 


Decamer 


16 


IMAISKAFEL 


24 


SEQ ID NO:49 


A3 


Nonamer 


66 


RVLPMGIQH 


29 


SEQ ID NO:73 


B0702 


Nonamer 


60 


RPRSSQRVL 


24 


SEQ ID NO:60 


BOS 


Nonamer 


17 


MAISKAFEL 


25 


SEQ ID NO:61 
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C- Prediction of Class II epitopes using the Tepitope method. 



C-1 Cripto-1 and -3 Class II epitopes. 



HLA allele 


EoitoDe size 


iStart 
Dosition 


Seouence 


Score 


SFO ID NO' 


DRB1*0102 


Nonamer 


70 


MGIQHSKEL 


1.8 


SEO ID NO -74 


DRB 1*0301 


Nonamer 


152 


LVMDEHLVA 


5.9 


SFO TD NO-75 


DRB 1*0301 


Nonamer 


158 


LVASRTPEL 


4.3 


SEO ID NO-76 


DRB 1*0401 


Nonamer 


152 


LVMDEHLVA 


3.2 


SFO ID NO-75 

wJX_#\^ XJ-X i. ^ V-^ , / 


DRB 1 *0402 


Nonamer 


59 


TRPRSSORV 


4 9 


SFO TD NO- 16 


DRB 1 *0703 


Nonamer 


1 1 


Y^VTWTMAT 




SFO TD NO-77 


DRB 1*0703 


Nonamer 


158 


LVASRTPEL 

J—i V ifTLO JTV X X d J_l 


5 7 


SFO TD NO-76 


DRB 1*0802 


Nonampr 


59 




1 R 


SFO TD NO* 1 6 


DRB 1 *0802 


Nonamer 




V V U XT XvXXv^ O J— Iv-. 


2 


SFO TD NO-78 


DRB 1 *0804 


Nonamer 




TRPP <^<^nRV 


2 R 


SFO TD NO- 1 6 


DRB 1 *0806 


Nonampr 

X ^ v^i imiiV'i 


59 


TRPR <^c;nRV 


^ 1 


SFO TD NO- 1 6 


DRB1*1 101 


N^on a mp r 


1 1 

X X 


VCiVTWTMA T 

X O V X VV XluA J. 




SFO TD NO-77 


DRB1*1 101 

J-/XVXJ I X 1 W X 


^Jon a mpr 


159 


T A/MnPHT A/a 


7 7 


SFO TD NO-7S 


DRB 1*1 102 


Non a mpr 


1 57 


T A/MDPHT A/a 


7 4 


SFO TD NO-7S 

OXZiV^ JUL/ i^Vy. / «J 


DRB 1*1 104 

X^Xx.J^ X 11 V/^T 


Non a mp r 


25 


T.ni A/AnT,r:H 

j_iv:rxj v.rt.vji_i\jrT 


7 8 


SFO TD NO-70 

OXZ«v^ Xlv !> Vv. / V 


DRB1*1 104 


Nonampr 


1 52 


T A/ivrnPHT A/A 


7 


SFO TD NO-7S 


DRB1*1 106 

XVXVXJI X XX \I\J 


Non a mp r 


25 


T.ni A/Ar:T.nFT 


7 R 


SFO TT^ TSIO-7Q 


DRB 1*1 106 


^Jon a tnp r 


1 52 


T A/MrHTTTT A/A 






DRB1*1 107 


Nonampr 


46 


IT XxL^iVO JL V V i V' 


7 8 


SFO TD NO -80 


DRB 1*1 107 


Nonamer 


152 


LVMDEHLVA 


5.9 


SEO ID NO-75 


DRB 1*1107 


Nonamer 


158 


LVASRTPEL 


3.3 


SEQ ID NO:76 


DRB1*1305 


Nonamer 


11 


YSVIWIMAI 


3.7 


SEQ ID NO:77 


DRB 1*1307 


Nonamer 


11 


YSVIWIMAI 


1.2 


SEQ ID NO:77 


DRB 1*1501 


Nonamer 


138 


LRCFPQAFL 


4.3 


SEQ ID NO:20 


DRB 1*1501 


Nonamer 


152 


LVMDEHLVA 


4.5 


SEQ ID NO:75 


DRB 1*1502 


Nonamer 


152 


LVMDEHLVA 


3.5 


SEQ ID NO:75 


DRB5*0101 


Nonamer 


13 


VIWIMAISK 


5.3 


SEQ ID NO:67 
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C-2 Cripto-] 


I specific class 11 epitopes. 


HLA allele 


Epitope size 


Start 
position 


ScQuence 


Score 




DRB1*0101 


Nonamer 


1 c 

1 D 


TaTT"M'3\ T CVT7"C 


1 .D 


ceo TF^ XT/^.Q 1 


DRB1*0102 


iNonamer 


1 /O 


JXlij V vj ± L. J-i O X 


1 ft 


OCI^ LL/ INVJ.oZ 


DRB1*0102 


rNona.mer 


1 /o 


\ TT^ TOT C T C 


1 7 
1 . / 


m MO-ft'^ 

oriv^ ILI 1NL/.03 


DRB 1*0301 


iNOnanier 


J / 


lUT A T O'^T rC*CT 


'X Q 




DRB 1*0401 


rNonamer 


1 7ft 
1 /O 




7 ft 
Z.o 


m MO»ft1 


DRB 1 *0402 


Nonamer 


1 7Q 
1 /o 


VVjjXCXibXyb 


^ 7 


oiil^ JU INvJ.o3 


DRB 1 *0404 


Nonamer 


1 A 


XWiJyiAXbK.V 




oriV^ LU INlJ.o4 


DRB 1 *0404 


Nonamer 


ill 


LiVGXCLiSXQ 






DRB 1*0404 


Nonamer 


1 /o 


VGICLiSXQS 




oiiv^ JJU JN\J:o3 


DRB 1*0405 


Nonamer 


1 

1 /3 


r MLi VG X C Li S 




Al-' JNiJloO 


DRB 1*0405 


Nonamer 


1 

III 


T T T/^ T TOT 

LiVGXCIjSxQ 


1 1 




DRB 1*0405 


Nonamer 


1 TO 

1 /o 


VGXCLSIQS 


o o 
z.o 


ciro Tr\ xT/^.oi 
btl^ INU:o3 


DRB 1 *0703 


Nonamer 


ID 


WIMAI SKVF 


J. / 


bt,y ID XNO:ol 


DRB 1*0703 


Nonamer 


1 / 


TV/r TV T O T<''T T'C "O T 

MAI SKVFEIj 


/.O 


lU iN(J:4j 


DRB 1*0703 


Nonamer 


1 n/^ 
1 /O 


MIjVGICLSX 




orlC^ ID INUloZ 


DRB 1*0801 


Nonamer 


1 TC 
1 /j 


FMLVGxCIjS 


3.0 


oiiv2 11-^ NO:oo 


DRB 1 *0802 


Nonamer 


1 /j 


FMLVGICLS 


3.0 


bbQ ID N(j:oo 


DRB 1*0804 


Nonamer 


1 

1 /j 


FMLVGICLS 


Z.o 


11^ INUlOO 


DRB1*1101 


Nonamer 


1 T^ 


FML VG I C L S 




AD JNiJioO 


DRB 1*1101 


Nonamer 


1 TfQ 

1 /o 


VGICLSIQS 


LA 


oriy ID JNU:o3 


DRB1*1104 


Nonamer 




T7»*irT T T/^ T T O 

FMLVGICLS 


2.9 


^xiQ ID NO:oO 


DRB1*1104 


Nonamer 


ill 


T T T/~» T TOT 

LVGICLSIQ 


Z.O 


oiil^ ID iNOloj 


DRB1*1104 


Nonamer 


1 '7Q 

1 /o 


T T/^ T T O T /~\ O 

VGICLSIQS 


5 A 


briv2 Ad iNCJ.o3 


DRB 1*1 106 


Nonamer 


1 /D 


C*TUrT T T/^ T T O 

FML VG I C L S 


z.y 


euro Tr\ XT/^.QA 
oril^ ID JNwIoO 


DRB1*1106 


Nonamer 


1 77 

ill 


T TT^T/^T OT^ 

LVGICLSIQ 


7 

Z.O 


oi^y ID JNvJ.oj 


DRB1*1106 


Nonamer 


1 /o 


T T/^ T T O T O 

VGICLSIQS 


3.4 


otsy ID JND,o3 


DRB1*1 107 


Nonamer 


1 T 
1 / 


'K/T'A TOT^TT"C"CT 

MAX bKVr h<L 


Z.y 


oiiy ID INU.4J 


DRB 1*1 107 


Nonamer 


ill 


T T 7/~« T T O T 

LVGICLSIQ 


3.0 


oiilj ID JNvj.oj 


DRB1*1107 


Nonamer 


1 7Q 


T T/^ T f^T O T O 

VGXCLSXQo 




ceo ir^ XTO-QI 

oiiv2 AD JN(J.o3 


DRB 1*1302 


Nonamer 


ID 


T«TT1Ur7V T O^^ J"C 

WIMAI SKVF 


5,5 


ceo m XTO.Ql 

orii^ ID JND.ol 


DRB 1*1302 


Nonamer 


1 7^ 


FML VG I C L S 


3 


oiil^ ID JNiJ.iSO 


DRB 1*1305 


IN onamer 


1 c 


WXMAXoJxVr 


3.1 


ceo m xio-fti 


DRB 1 * 1 305 


Nonamer 


1 7^ 

1 ID 


rMLVGXCLo 


4.3 


ceo \W XTO.QA 
olil^ ID iNVJ.oO 


DRB 1*1307 


rSKjlialllCT 


1 7S 


r iXLJLi VVjrXV^ J-iO 




ceo rn xto-ra 


DRB 1*1307 


Nonamer 


177 


LVGICLSIQ 


1.6 


SEQ ID NO:85 


DRB1*1501 


Nonamer 


6 


MARFSYSVI 


4.5 


SEQ ID NO:43 


DRB1*1501 


Nonamer 


176 


MLVGICLSI 


4.1 


SEQ ID NO:82 


DRB 1*1502 


Nonamer 


6 


MARFSYSVI 


3.5 


SEQ ID NO:43 


DRB5*0101 


Nonamer 


15 


WIMAI SKVF 


4.1 


SEQ ID NO: 81 
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C-2 Cripto-3 specific class II epitopes. 







Start 


lL7C^ UCl ICC 


ocorc 




DRRl *0101 




15 




1 ^ 


OJ-fV^ J-L' 1NV-/.0 / 


DRB1*0102 




6 




1 4 


SFO TD 

01_<V^ JX-' iNvy.oo 


DRB1*0102 




17 


MAT ciKAFFT. 


1 6 
1 ,\j 


SFO TD Nn*61 


DRB 1*0401 


^Jnn r 


7 


V X\.£^ O J. O V J. VV 


9 Q 


CPO TD Nn'51 


DRR 1*0401 


^Joti 5) mi* r 


175 


FMT.AfiTPT.C! 

1*1 X w J-l O 


9 7 




DRB 1 *0402 


Nnn Ji mp r 

1 ^ Vdfl lul I Iwl 


67 


V J_J 1V3 J. V^Xl O 


^ 6 


^FO TD NO QO 


DRB 1 *0404 


l^lJIlu.lllC>i 






9 5 


<\Fn TD 

OdV^ J-L/ InV-^.OO 


DRB 1 *0404 




14 


TTaTTMAT cjxf A 

X vv J.l\LrlX Olvn 


6 


9FO TD MO-Ol 


DRB 1 *0404 




67 




^ 5 


^FO TD NO-QO 


DRR 1 *040S 


^Jrkti fmnpf* 

l^UIIullICl 


175 


f ITl J_Lr\v3 X V_ J-i O 


9 7 




DRRi *n7n"^ 


i> viliaiilCi 


7 


\7T? F CVQ\7TTa7 
V Xxir O X O V X W 


fk S 

D.J 


^FO TD MO'^l 


DRBl *070'^ 


i^iJllCtlllCl 


1 <^ 
1 J 




5 7 


<sFO TD Mn-87 
omv^ ii-' iNv-/.o / 


DRRl *070'^ 


iNuiiciiiicr 


1 7 
1 / 


MA T QTTZiFFT 

rxH.x oJxn.r JZfij 


7 S 


oCV^ ll-' INw.Oi 


DRRl *0S01 


i\ uiid-iiicr 


1 A 






QFO rr> MO-Q9 


DRRl *0R01 


i^uiiaiiicr 


17S 


FMT zvnTr^T C 


J.D 


^FO TFi 1MO-8Q 


DR R 1 *08n9 


1^ oiiaiiicr 


o / 




1 Q 


QFO m Km -on 


DRRl *0R09 


i>uiiciiiicr 


17S 


r rlLiJ\Kj X V-^ Jj o 


J>. J 




DRR 1 *n804 


INUIlalllCI 




\7"T 'Divrr' TOTJQ 
V ±j IriYLVj X ViJ. O 


9 Q 


QFO TT^ KTO-On 


DRR 1 *ORn4 


IN UllalllCI 


1 7S 


r lYl X J_i o 


9 ^ 


Tr> TsJO-RQ 

t>mv^ iL/ iNw.o" 


DRR 1 *0R06 


IN wllctlllCl 


1 u 


Xi\L/-\.X OIx/\r -tj 


/I 


^FO TFJ KIO-Q9 


DR R 1 * 1 1 0 1 


INLIIlalllcr 


1 7S 


FMT ar'Tr'T c 






DR R 1 * 1 1 09 

l-/I\.Jj 1 11 


iNOilalllCr 


o / 


V i-i trriKj X 


1 

J. I 


QFO Tr> MO-on 


DR B 1 * 1 1 09 


l>\JllcllllCl 


175 


FMT ACTCT Q 


9 S 

A.J 


<2FO TD MH-RQ 
oII2»\^ 11-/ i>*v>/.o^ 


DRBl *1 107 


l^Ulldllld 


7 

1 


V X\Xr O X O V X vv 


9 R 


^FO TD Mn-<N 1 


DRR 1*1*^01 


IXJIlalllCI 




V Xi ±rl*lVj X V-O. O 


5 
J.J 


QFO TD Tsin-on 

oCV^ 11-/ INv-/,3/u 


DR R 1 * 1 '^09 


IXJIIalllcr 




TaTTMA T QTf a F 




QFO Tn 'NrO-R7 


DRBl * 1 '^09 


1 >i ijl 1 al 1 iCl 


1 75 
1 / J 


FMT Ar'Tr'T Q 


Q 


QFO TT^ MO-RQ 


DRB1*1^0S 


iNVJllalllcl 


1 J 


TaTTMA T Q'K' a F 

vV X I^lrt. X O xVrt. J7 


1 

J . 1 


<\Fn TD MO R7 

OlJ#V^ lU/ INV-/.0 / 


DRB 1*1305 


Nonampr 

X ^ Wl lUI 1 IWl 


175 


XT 1 X J_I^^VJ X J_l k_> 




SFO TD 


DRB1*1307 


Nonamer 


67 


VLPMGIQHS 


1.5 


SEQ ID NO:90 


DRB 1*1307 


Nonamer 


175 


FMLAGICLS 


3.5 


SEQ ID NO:89 


DRB1*1501 


Nonamer 


6 


MVRFSYSVI 


6.6 


SEQ ID NO:88 


DRB 1*1502 


Nonamer 


6 


MVRFSYSVI 


5.6 


SEQ ID NO:88 


DRB5*0101 


Nonamer 


15 


WIMAISKAF 


4.1 


SEQ ID NO: 87 



Example 5 : anti-tumor potential of the humoral response induced by Cripto 
5 vaccines 

A series of experiments aimed at assessing the inhibitory effect of the Cripto-specific 
humoral response on the pathophysiological activities of Cripto in cancer could be 
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carried out by using various standards in vitro assays. The in vitro assays could be, but 
not restricted to, growth inhibition assay, cell motility inhibition assay, chemotaxis 
inhibition assay, inhibition of the invasion through extracellular matrix protein (ECM), 
and growth signal transduction pathway inhibition. The effects of the sera of 
5 inmiunized animals with Cripto peptides or protein in adjuvant, or with a plasmid DNA 
or a viral delivery system, e.g. adenoviral vector, encoding the Cripto protein could be 
assessed in these in vitro assays that gauge Cripto-mediated biological effects on 
Cripto-expressing cell lines. In parallel, the effect of the pre-immune sera from the 
same animals will be tested as negative control. The cell lines used in these assays 

10 could be, but not limited to, human tumor cells expressing Cripto such as GEO cells, 
NTERA2 cells, the CRL-5815 cell line, or murine tumor cell lines that naturally over- 
express Cripto. As example, the inhibition of the in vitro growth of both GEO and 
NTERA2 cells has previously been demonstrated by treatment with anti-sense 
oligonucleotides designed to prevent the translation of Cripto mRNA (Baldassarre, et 

15 al, Int J Cancer 66: 538-543 (1996); Ciardiello, et aL Oncogene 9:291-298 (1996); 
Alpe, et al Int J Cancer 88: 566-574 (2000)) 

5.1 : Immunization protocol : 

20 Mice or rabbits would be immunized on day 0, 14 , and 21 by intra- footpad injections 
of either peptides or protein in adjuvant, intra-dermal injections of a plasmid DNA 
encoding Cripto using gene-gun devices, or intra-dermeil injections of a viral vector 
delivery system, e.g. adenoviral vector, encoding Cripto. 

25 5.2 : Cell proliferation inhibition assay : 

The sera from immunized animals will be collected and added at different dilutions to 
the culture medium of cells platted in 96-well plates. Similarly, pre-immune sera of 
these animals will also be added to cells as negative control. The cells will be treated 
30 for 3 to 7days. The cell growth will be measured by standard methods such as ^H- 
. thymidine incorporation assay, MTT assay, crystal violet staining, or colony counting 
for proliferation in soft agar-medium. 

5.3 : Cell Invasion, motility, and chemotaxis inhibition assays : 

35 

The sera from inmiunized animals will be collected pre- and post-immunization and 
added at different dilutions to cell suspensions used for invasion, motility, and 
chemotaxis standards assays. The inhibition of Cripto-mediated invasion, motility, and 
chemotaxis could be assessed with the use of, for instance, but limited to, commercially 
40 available Falcon chambers with matrigel inserts (Collaborative Research), agarose 
droplet motility assay (Yamamoto et aL Ophthalmology 97: 1204-1210 (1994)), and 
commercially available Boyden apparatus (Neuro Probe), respectively. 

5.4 : Signal transduction inhibition assays: 

45 

The sera from immunized animals will be collected pre- and post-immunization and 
added at different dilutions to Cripto-expressing tumor cells in culture. Exogenous 
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Cripto protein or human sera or milk containing the highest concentration of Cripto 
detected by ELISA (Bianco, et ai. Breast Cancer Res Treat 66:1-7 (2001)) will be 
added to the culture medium of the cells. After various incubation times, the cells will 
be harvested and processed by standard methods in order to perform immunoblot 
5 analysis. The phosphorylation status of various key signal transduction pathways 
involved in cell proliferation will be assessed in these Cripto-simulated cells in the 
presence of pre- or post-immune sera. For instance, the tyrosine phosphorylation status 
of erb B-4 and mitogen-activated protein (MAP) kinase family such as ERKl, ERK-2, 
and P38 will be assessed by immunoblotting with antibodies that recognize the 
10 phosphorylated, active form of these enzymes as previous described (Bianco, et al. J 
Biol Chem 274: 8624-8629 (1999); Paine, et al J Biol Chem 275: 11284-11290 
(2000)). 

Example 6; in vivo anti-tumor effect of the immune response induced by Cripto 
IS vaccines in animal models 

The prophylactic or therapeutic potential of vaccines containing the human Cripto 
protein, Cripto peptides, or Cripto gene can be evaluated in mice challenged with 
sjoigeneic murine tumor cell line that express Cripto. The tumor cell lines could be a 

20 murine tumor cell line transfected with the human Cripto gene. For instance the 

transfected cell lines could be the TCI cell line transfected with human Cripto gene. On 
the other hand, the high level of homology between human Cripto and its murine 
homologue suggests that cross-reactive inmiune responses induced by the human 
vaccine can protect against mouse Cripto-expressing tumors. Indeed, Cripto gene 

25 (TDGF-1) encodes a 171 -amino acid protein which has 93% identity with its human 
counterpart (Liguori, et al, Mamm Genome 7:344-348 (1996)). Therefore, mice could 
be protected by immunization with a human Critpo vaccine from murine tumor 
challenge or spontaneously arising tumor known to endogenously over-express the 
Cripto protein. With this respect, spontaneous tumors in transgenic mice designed to 

30 over-express several different oncogenes such as MMTV-Polyoma virus middle T 

antigen, MMTV-c-ErbB2, and MT-hTGF alpha, have been shown also to over-express 
Cripto. (Kenney, et al Mol Carcinog 15: 44-56 (1996); Niemeye, et al Int J Cancer 
81:588-591 (1999)). 

35 Alternatively, in order to vaccinate with the syngeneic gene, the tumor-bearing mice 
could be vaccinated with a plasmid DNA or a viral delivery system, e.g. adenoviral 
vector, encoding the murine Cripto protein to protect from murine tumor growth. 

6.1 : Prophylactic experimental design: 

40 

Mice would be vaccinated on day 0 and 14, prior tumor challenge, by either intra - 
footpad injections of 5|lg of Cripto protein in adjuvant, intra-dermal injections of DNA 
plasmid encoding Cripto using gene-gun technology, or intra-dermal injections of a 
viral vector, e.g. adenoviral vector, encoding Cripto. Then, 10^ TCl-CR-1 cells, human 
45 Cripto-expressing tumor cells, could be injected subcutaneously in the flank of 
C57BLy6 immunocompetent mice 1 or 2 weeks post vaccination. The tumor growth 
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should be monitored in vivo by measuring individual tumors twice a week for several 
weeks post-tumor challenge. 

On the other hand, the efficacy of prophylactic vaccination could be assessed by 
5 inhibiting the development of spontaneous Cripto-expressing tumors in transgenic mice 
(Niemeye, et aL Int J Cancer 81:588-591 (1999)) immunized multiple times before 
onset of the palpable tumor with either form of Cripto vaccines. 

6.2 : Therapeutic experimental design : 

10 

10^ TCl-CR-1 cells, human-Critpo expressing tumor cells, would be injected 
subcutaneously in the flank of C57BL/6 immunocompetent mice. Mice could be 
vaccinated on days 7 and 14, post-tumor challenge, by either intra -footpad injections 
of 5|Xg Cripto protein in adjuvant, intra-dermal injections of DNA plasmid encoding 

15 Cripto using gene-gun technology, or intra-dermal injections of a viral vector, e.g. 
adenoviral vector, encoding Cripto. The tumor growth should be monitored in vivo by 
measuring individual tumors twice a week. One to 4 weeks after the second 
immunization, several mice per group will be sacrificed to harvest spleen cells, draining 
lymph nodes, and sera for analysis of the immune responses to establish a correlation 

20 between the induction of a Cripto-specific immune response and the anti-tumor effect. 
The analysis of the Cripto-specific immune response induced by immunization could 
be assessed by measuring the antibody titers, antibody isotypic profile, the CD4^ T-cell 
proliferation response, and CTL CD8"^ T-cell responses including cytokines production 
and lysis activity against Cripto-expressing target cells. All assays would be performed 

25 according to standards protocols. 

A similar kind of experiment could be carried out by challenging parental mice with 
transplantable murine mammary tumor lines over-expressing Cripto. These cells would 
be established from spontaneous tumor arising in various oncogene transgenic mouse 
30 strains such as MMTV-Polyoma virus middle T antigen, MMTV-c-ErbB2, and MT- 
hTGF alpha transgenic mice (Niemeye, et al. Int J Cancer 81:588-591 (1999)). 

Example 7; Demonstration of Cripto-1 immunogenicitv and recognition of Cripto 
protein 

35 

7.1 Objectives 

This experiment demonstrates that a selected polypeptide fragment of Cripto (SEQ ID 
NO: 97) mixed with a vaccine adjuvant induces Cripto peptide-specific antibody 
40 response in immunized animals. Furthermore, the assessment of immunological 
recognition of Cripto protein shows that antibodies recognize the tumour-associated 
antigen itself. Finally, an immunostaining study on cancer tissues using anti-Cripto 
antibodies shows Cripto-specific immune response to target cripto naturally expressed 
in cancer cells. 

45 
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7.2 Selection of Cripto-1 derived peptides 

A 17mer synthetic peptide (CPPSFYGRNCEHDVRKE - SEQ ID NO:97) that 
corresponds to amino acids 97 to 113 of the human Cripto-1 sequence was prepared 
5 from the amino acid sequence of human Cripto-1. This peptide was coupled to KLH 
using gluteraldehyde. 

7.3 Immunization protocol of rabbits 

10 Two rabbits were each immunized with KLH coupled peptide. Antibodies were 
produced in rabbits during 3 months according to a protocol described in section 3.1 of 
the specification. Antisera were tested by ELISA against the peptide and the carrier 
protein. 

15 7.4 Test of rabbits sera on extracts of recombinant full-length Cripto-1 expressed 
in E. coli 

E. coli recombinant protein Cripto-1 (18 Kd) was detected with the sera of rabbits 
injected with synthetic peptide aa97->113 (SEQ ID NO:97). Western blots using 

20 monoclonal antibodies to Cripto-1 and rabbit antisera from rabbits injected with SEQ 
ID NO:97 indicated the presence of a single intense polypeptide band at approximately 
18 kD in cell extract as well as cell pellets. In addition, Coommassie Blue stained 
polyacrylamide gel indicated the presence of a polypeptide (-18 kD). No polypeptide 
having a molecular weight of 18kD was detected by antibodies, antisera, or 

25 Coommassie Blue stain in the negative control or cellular extract supernatant. 

Example 8: Demonstration of antibody recognition of Cripto in tumour tissue by 
immunostaining studies using anti-Cripto polyclonal antibodies 

30 SCID mice subcutcineous xenograft tumors (mRNA cripto-expressing CRL5815 cell 
line - human small cell lung carcinoma - expression determined by real time PCR 
methods) were incubated in buffered formalin fixative for 24h at room temperature and 
paraffin embedded. Five micron thick sections were attached to superfrost glass slides 
and deparaffinized. Prior to antibody binding, the sections were heated in glyca buffer 

35 (Biogenex) (microwave for 10 min), then saturated with PBS buffer containing 0.5% 
blocking reagent (Boerhinger) for 30 min. Two rabbit anti-human CR-1 peptides (32- 
44; 54-71) were separately added (dil. 1/50 - 1/200) and incubated at 4'^C overnight. 
The sections were rinsed three times in PBS, incubated with mouse-anti-rabbit IgG 
biotinylated secondary antibody (Sigma) at 1:500 in PBS/ 0.5% blocking reagent for 30 

40 min at room temp. The sections were then rinsed three times in PBS, treated with 
H2O2 1% to quench endogenous peroxidases, incubated in HRP-Streptavidin (Zymed) 
at 1 : 500 in PBS and, finally, after PBS rinsing, HRP activity was revealed using DAB 
(Zymed) solution for 5 min. The reaction was stopped with extensive rinsing in 
deionised water. The sections were counterstained for 2 min. in hematoxylin, brought 

45 to xylene and mounted with DPX solution. 
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8.1 Results 

Cripto-l peptide was detected using rabbit anti-human CR-1 peptide 54*71 (dil. 1/50) 
5 (SEQ ED NO: 12) by immunostaining. The immunostaining specificity was confirmed 
by preabsorption of the anti-CR-1 peptide (54-71) antibody with the corresponding 
purified Cripto-l peptide (50 jig/ml). 

Cripto peptides formulated in vaccine adjuvant are inmiunogenic in animals. 
10 Antibodies raised in immunized rabbits may recognize not only a peptide used as an 
inmiunogen, but also a recombinant Cripto protein expressed in E.coli and Cripto 
naturally expressed in tumour tissue. Cripto is a suitable target for active 
immunotherapy. 

15 Example 9; Demonstration of the immunogenicitv of recombinant Cripto-l 
protein 

9.1* Immunogenicity of recombinant Cripto-l protein mixed v^ith adjuvant 

20 Recombinant Cripto protein purified from E.Coli was used to inmiunize rabbits as 
described in Example 3. 

9.2. ELISA protocol to monitor of an antibody response 

25 Quantification of anti-Cripto antibodies in rabbit sera was performed by ELISA using 
Cripto protein as coating antigen and expressed as the titer at midpoint dilution. 

For example, antigen and antibodies solutions were used at 100 fil/well. Antigen was 
diluted to a final concentration of 1 pg/ml in PBS and was absorbed overnight at 4°C to 

30 the wells of 96 wells microtiter plates. The plates were then incubated for 1 hour at 
37°C with PBS containing 1% bovine serum albumin and 0.1% tween 20 (saturation 
buffer). Two-fold dilutions of sera (starting 1/100) in the saturation buffer were added 
to the Cripto coated plates and incubated for 1 hour 30 minutes at 37°C. The plates 
were washed four times with PBS 0.1% Tween 20 and biotin-conjugated anti-rabbit Ig 

35 diluted 1/5000 in saturation buffer were added to each well. The plates were incubated 
for 1 hour 30 minutes at 37°C. After washing step, streptavidin biotinylated 
peroxydase complex reagent diluted 1/5000 in saturation buffer was added. Plates were 
kept for an additional 30 min at 37°C. 

40 The plates were washed as above and incubated for 15 minutes with TMB. The reaction 
was stopped with 0.2 M H2SO4 and the optical densities were measured using a 
microplate reader connected to a computer. Data were captured with SoftMaxPro 
software and analyzed. 
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9.3 Results: 

No anti-Cripto antibodies were detected pre-vaccination as shown in Table 2. At day 

21 after the second vaccinations all the three animals showed anti-Cripto eintibodies, 
5 and 21 days after 4 vaccinations antibody titers were increased as shown in Table 2. 



Table 2 





Mid Point Dilution 




Pre- 
vaccination 


21 days 
Post 1 


21 days 
Post 2 


21 days 
Posts 


21 days 
Post 4 


rabbit 777 


232 


50 


2686 


2858 


1127 


rabbit 778 


58 


67 


3074 


4848 


7645 


rabbit 779 


132 


119 


8548 


8315 


11523 



This experiment shows that repeated administration of purified Cripto adjuvanted with 
10 QS21, 3D-MPL and oil-in-water emulsion induces specific antibodies. This experiment 
also demonstrates that recombinant Cripto antigen delivered with a vaccine adjuvant is 
immunogenic. 

Example 10; Immunogenicitv of recombinant Cripto-1 gene bv genetic DNA 
IS immunisation 

This experiment demonstrates that recombinant Cripto when delivered as a nucleic acid 
sequence (genetic immunization) encoding Cripto protein induces a Cripto-specific 
inmiune response in mice. 

20 

10.1. Genetic construct design for in vivo expression & mouse immunization - 

Genetic construct, in vivo expression and mouse immunization were performed as 
described in Example 3. 

25 

10.2. Determination of serum antibodies by Western blot on crude extracts of 
E.coli recombinant Cripto-1 protein 

Serum from vaccinated mice was taken at day 0 and 42, for Westem blot analyses for 
30 the presence of anti-cripto antibodies. Proteins contained in the solubilized pellet of an 
acellular extract, were separated by SDS-PAGE and used to test the sera of mice by 
Westem blot. The pre-immune and post-immune semm (42 days post-immunisation) 
of eight mice, and a pool of the negative control group sera were tested at the dilution 
1/500. 

35 

The sera of all inmiunized mice reacted with mature Cripto-1 recombinant protein 
present in the extract as demonstrated by Westem blot analysis. 

This experiment shows that repeated administration of cripto DNA sequence can 
40 induce Cripto specific antibodies. Therefore, recombinant Cripto protein delivered as 
the genetic vaccine is immunogenic in mice. 
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All publications and references, including but not limited to patents and patent 
applications, cited in this specification are herein incorporated by reference in their 
entirety as if each individual publication or reference were specifically and individually 
5 indicated to be incorporated by reference herein as being fully set forth. Any patent 
application to which this application claims priority is also incorporated by reference 
herein in its entirety in the manner described above for publications and references. 



-89- 



VU60111 



Sequence Listing 
SEQ ID NO:l 

5 GGAGAATCCCCGGAAAGGCTGAGTCTCCAGCTCAAGGTCAAAACGTCCAAGGCCGAAA 
GCCCTCCAGTTTCCCCTGGACGCCTTGCTCCTGCTTCTGCTACGACCTTCTGGGGAAA 
ACGAATTTCTCATTTTCTTCTTAAATTGCCATTTTCGCTTTAGGAGATGAATGTTTTC 
CTTTGGCTGTTTTGGCAATGACTCTGAATTAAAGCGATGCTAACGCCTCTTTTCCCCC 
TAATTGTTAAAAGCTATGGACTGCAGGAAGATGGCCCGCTTCTCTTACAGTGTGATTT 

10 GGATCATGGCCATTTCTAAAGTCTTTGAACTGGGATTAGTTGCCGGGCTGGGCCATCA 
GGAATTTGCTCGTCCATCTCGGGGATACCTGGCCTTCAGAGATGACAGCATTTGGCCC 
CAGGAGGAGCCTGCAATTCGGCCTCGGTCTTCCCAGCGTGTGCCGCCCATGGGGATAC 
AGCACAGTAAGGAGCTAAACAGAACCTGCTGCCTGAATGGGGGAACCTGCATGCTGGG 
GTCCTTTTGTGCCTGCCCTCCCTCCTTCTACGGACGGAACTGTGAGCACGATGTGCGC 

15 AAAGAGAACTGTGGGTCTGTGCCCCATGACACCTGGCTGCCCAAGAAGTGTTCCCTGT 
GTAAATGCTGGCACGGTCAGCTCCGCTGCTTTCCTCAGGCATTTCTACCCGGCTGTGA 
TGGCCTTGTGATGGATGAGCACCTCGTGGCTTCCAGGACTCCAGAACTACCACCGTCT 
GCACGTACTACCACTTTTATGCTAGTTGGCATCTGCCTTTCTATACAAAGCTACTATT 
AATCGACATTGACCTATTTCCAGAAATACAATTTTAGATATCATGCAAATTTCATGAC 

20 CAGTAAAGGCTGCTGCTACAATGTCCTAACTGAAAGATGATCATTTGTAGTTGCCTTA 
AAATAATGAATACAATTTCCAAAATGGTCTCTAACATTTCCTTACAGAACTACTTCTT 
ACTTCTTTGCCCTGCCCTCTCCCAAAAAACTACTTCTTTTTTCAAAAGAAAGTCAGCC 
ATATCTCCATTGTGCCTAAGTCCAGTGTTTCTTTTTTTTTTTTTTTTTGAGACGGAGT 
CTCACTCTGTCACCCAGGCTGGACTGCAATGACGCGATCTTGGTTCACTGCAACCTCC 

25 GCATCCGGGGTTCAAGCCATTCTCCTGCCTAAGCCTCCCAAGTAACTGGGATTACAGG 
CATGTGTCACCATGCCCAGCTAATTTTTTTGTATTTTAGTAGAGATGGGGGTTTCACC 
ATATTGGCCAGTCTGGTCTCGAACTCTGACCTTGTGATCCATCGATCAGCCTCTCGAG 
TGCTGAGATTACACACGTGAGCAACTGTGCAAGGCCTGGTGTTTCTTGATACATGTAA 
TTCTACCAAGGTCTTCTTAATATGTTCTTTTAAATGATTGAATTATATGTTCAGATTA 

30 TTGGAGACTAATTCTAATGTGGACCTTAGAATACAGTTTTGAGTAGAGTTGATCAAAA 
TCAATTAAAATAGTCTCTTTAAAAGGAAAGAAAACATCTTTAAGGGGAGGAACCAGAG 
TGCTGAAGGAATGGAAGTCCATCTGCGTGTGTGCAGGGAGACTGGGTAGGAAAGAGGA 
AGCAAATAGAAGAGAGAGGTTGAAAAACAAAATGGGTTACTTGATTGGTGATTAGGTG 
GTGGTAGAGAAGCAAGTAAAAAGGCTAAATGGAAGGGCAAGTTTCCATCATCTATAGA 

35 AAGCTATATAAGACAAGAACTCCCCTTTTTTTCCCAAAGGCATTATAAAAAGAATGAA 
GCCTCCTTAGAAAAAAAATTATACCTCAATGTCCCCAACAAGATTGCTTAATAAATTG 
TGTTTCCTCCAAGCTATTCAATTCTTTTAACTGTTGTAGAAGACAAAATGTTCACAAT 
ATATTTAGTTGTAAACCAAGTGATCAAACTACATATTGTAAAGCCCATTTTTAAAATA 
CATTGTATATATGTGTATGCACAGTAAAAATGGAAACTATATTGACCTAAAAAAAAAA 

40 AAA 
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SEQ ID NO:2 

AAGCTTGCGCGCCATGTAAGGTAAAGTGACTGATTCTATAGCAATCCAATTGTTCCTT 
TGTCTGCCCGTTTACATATAACAATGTTGTCAATGTTTGATTGAAAATACCTAGCAGG 
5 CGACACACACACACCTAGCTCCTCAGGCGGAGAGCACCCCTTTCTTGGCCACCCGGGT 
ATCCCCCAGGGAGTACGGGGCTCAAAACACCCTTTTGGAGAACAAGGTGGAAGCAAAT 
TTCAGGAAGTAAAACTTCCTGAAATAAAATAAAATATCGAATGCCTTGAGACCCATAC 
ATTTTCAGGTTTTCCTAATTAAAGCAATTACTTTCCACCACCCCTCCAACCTGGAATC 
ACCAACTTGGTTAGAGAAACTGATTTTTCTTTTTTCTTTTTTTTTCCCAAAAGAGTAC 

10 ATCTGATCATTTTAGCCTGCAACTAATGATAGAGATATTAGGGCTAGTTAACCACAGT 
TTTACAAGACTCCTCTCCCGCGTGTGGGCCATTGTCATGCTGTCGGTCCCGCCCACCT 
GAAAGGTCTCCCCGCCCCGACTGGGGTTTGTTGTTGAAGAAGGAGAATCCCCGGAAAG 
GCTGAGTCTCCAGCTCAAGGTCAAAACGTCCAAGGCCGAAAGCCCTCCAGTTTCCCCT 
GGACACCTTGCTCCTGCTTCTGCTACGACCTTCTGGGAACGCGAATTTCTCATTTTCT 

15 TCTTAAATTGCCATTTTCGCTTTAGGAGATGAATGTTTTCCTTTGGCTGTTTTGGCAA 
TGACTCTGAATTAAAGCGATGCTAACGCCTCTTTTCCCCCTAATTGTTAAAAGCTATG 
GACTGCAGGAAGATGGTCCGCTTCTCTTACAGTGTGATTTGGATCATGGCCATTTCTA 
AAGCCTTTGAACTGGGATTAGTTGCCGGGCTGGGCCATCAGGAATTTGCTCGTCCATC 
TCGGGGAGACCTGGCCTTCAGAGATGACAGCATTTGGCCCCAGGAGGAGCCTGCAATT 

20 CGGCCTCGGTCTTCCCAGCGTGTGCTGCCCATGGGAATACAGCACAGTAAGGAGCTAA 
ACAGAACCTGCTGCCTGAATGGGGGAACCTGCATGCTGGAGTCCTTTTGTGCCTGCCC 
TCCCTCCTTCTACGGACGGAACTGTGAGCACGATGTGCGCAAAGAGAACTGTGGGTCT 
GTGCCCCATGACACCTGGCTGCCCAAGAAGTGTTCCCTGTGTAAATGCTGGCACGGTC 
AGCTCCGCTGCTTTCCTCAGGCATTTCTACCCGGCTGTGATGGCCTTGTGATGGATGA 

25 GCACCTCGTGGCTTCCAGGACTCCAGAACTACCACCGTCTGCACGTACTACCACTTTT 
ATGCTAGCTGGCATCTGCCTTTCTATACAAAGCTACTATTAATCGACATTGACCTATT 
TCCAGAAATACAATTTTAGATATTATGCAAATTTCATGACCCGTAAAGGCTGCTGCTA 
. CAATGTCCTAACTGAAAGATGATCATTTGTAGTTGCCTTAAAATAATGAATACAATTT 
CCAAAACGGTCTCTAACATTTCCTTACAGAACTAACTACTTCTTACCTCTTTGCCCTG 

30 CCCTCTCCCAAAAAACTACTTCTTTTTTCAAAAGAAAGTCAGCCATATCTCCATTGTG 
CCCAAGTCCAGTGTTTCTTTTTTTTTTTTGAGACGGAGTCTCACTCTGTCACCCAGGC 
TGGACTGCAATGACGCGATCTCGGTTCACTGCAACCTCCGCATCCGGGGTTCAAGCCA 
TTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCATGTGTCACCATGCCGGC 
TAATTTTTTTGTATTTTAGTAGAGACGGGGGTTTCACCATATTGGCCAGCTGGTCTCG 

35 AACTCTGACCTTGTGATCCATCGCTCGCCTCTCGAGTGCTGAGATTACACACGTGAGC 
AACTGTGCAAGGCCTGGTGTTTCTTGATACATGTAATTCTACCAAGGTCTTCTTAATA 
TGTTCTTTTAAATGATTGAATTATACACTCAGATTATTGGAGACTAAGTCTAATGTGG 
ACCTTAGAATACAGTTTTGAGTAGAGTTGATCAAAATCAATTAAAATAGTCTCTTTAA 
AAGGAAAGAAAACATCTTTAAGGGGAGGAACCAGAGTGCTGAAGGAATGGAAGTCCAT 

40 CTGCGTGTGTGCAGGGAGACTGGGTAGGAAAGAGGAAGCAAATAGAAGAGAGAGGTTG 
AAAAACAAAATGGGTTACTTGATTGGTGATTAGGTGGTGGTAGAGAAGCAAGTAAAAA 
GGCTAAATGGAAGGGCAAGTTTCCATCATCTATAGAAAGCTATGTAAGACAAGGACTC 
CCCTTTTTTTCCCAAAGGCATTGTAAAAAGAATGAAGTCTCCTTAGAAAAAAAATTAT 
ACCTCAATGTCCCCAACAAGATTGCTTAATAAATTGTGTTTCCTCCAAGCTATTCAAT 
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TCTTTTAACTGTTGTAGAAGAGAAAATGTTCACAATATATTTAGTTGTAAACCAAGTG 
ATCAAACTACATATTGTAAAGCCCATTTTTAAAATACATTGTATATATGTGTATGCAC 
AGTAAAAATGGAAACTATATTGACCTAAAAAAAAAAAAAGGAAACCACCCTTAGGCAG 
GCAGGACATGCTCTTCAGAACTCTGCTCTTCAGAGTTCCAAAGAAGGGATAAAACATC 
5 TTTTATNNCCATCAAATAGC 



SEQ ID NO:3 

MDCRKMARFSYSVIWIMAISKVFELGLVAGLGHQEFARPSRGYLAFRDDSIWPQEEPA 
10 IRPRSSQRVPPMGIQHSKELNRTCCLNGGTCMLGSFCACPPSFYGRNCEHDVRKENCG 
SVPHDTWLPKKCSLCKCWHGQLRCFPQAFLPGCDGLVMDEHLVASRTPELPPSARTTT 
FMLVGICLSIQSYY 

SEQ ID NO:4 

15 

MDCRKMVRFSYSVIWIMAI SKAFELGLVAGLGHQEFARPSRGDLAFRDDS IWPQEEPA 

IRPRSSQRVLPMGIQHSKELNRTCCLNGGTCMLESFCACPPSFYGRNCEHDVRKENCG 
SVPHDTWLPKKCSLCKCWHGQLRCFPQAFLPGCDGLVMDEHLVASRTPELPPSARTTT 
FMLAGICLSIQSYY 

20 

SEQ ID NO:5 

GGAGAATCCCCGGAAAGGCTGAGTCTCCAGCTCAAGGTCAAAACGTCCAAGGCCGAAA 
GCCCTCCAGTTTCCCCTGGACGCCTTGCTCCTGCTTCTGCTACGACCTTCTGGGGAAA 

25 ACGAATTTCTCATTTTCTTCTTAAATTGCCATTTTCGCTTTAGGAGATGAATGTTTTC 
CTTTGGCTGTTTTGGCAATGACTCTGAATTAAAGCGATGCTAACGCCTCTTTTCCCCC 
TAATTGTTAAAAGCTATGGACTGCAGGAAGATGGCCCGCTTCTCTTACAGTGTGATTT 
GGATCATGGCCATTTCTAAAGTCTTTGAACTGGGATTAGTTGCCGGGCTGGGCCATCA 
GGAATTTGCTCGTCCATCTCGGGGATACCTGGCCTTCAGAGATGACAGCATTTGGCCC 

30 CAGGAGGAGCCTGCAATTCGGCCTCGGTCTTCCCAGCGTGTGCCGCCCATGGGGATAC 
AGCACAGTAAGGAGCTAAACAGAACCTGCTGCCTGAATGGGGGAACCTGCATGCTGGG 
GTCCTTTTGTGCCTGCCCTCCCTCCTTCTACGGACGGAACTGTGAGCACGATGTGCGC 
AAAGAGAACTGTGGGTCTGTGCCCCATGACACCTGGCTGCCCAAGAAGTGTTCCCTGT 
GTAAATGCTGGCACGGTCAGCTCCGCTGCTTTCCTCAGGCATTTCTACCCGGCTGTGA 

35 TGGCCTTGTGATGGATGAGCACCTCGTGGCTTCCAGGACTCCAGAACTAGCACCGTCT 
GCACGTACTACCACTTTTATGCTAGTTGGCATCTGCCTTTCTATACAAAGCTACTATT 
AATCGACATTGACCTATTTCCAGAAATACAATTTTAGATATCATGCAAATTTCATGAC 
CAGTAAAGGCTGCTGCTACAATGTCCTAACTGAAAGATGATCATTTGTAGTTGCCTTA 
AAATAATGAATACAATTTCCAAAATGGTCTCTAACATTTCCTTACAGAACTACTTCTT 

40 ACTTCTTTGCCCTGCCCTCTCCCAAAAAACTACTTCTTTTTTCAAAAGAAAGTCAGCC 
ATATCTCCATTGTGCCTAAGTCCAGTGTTTCTTTTTTTTTTTTTTTTTGAGACGGAGT 
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CTCACTCTGTCACCCAGGCTGGACTGCAATGACGCGATCTTGGTTCACTGCAACCTCC 
GCATCCGGGGTTCAAGCCATTCTCCTGCCTAAGCCTCCCAAGTAACTGGGATTACAGG 
CATGTGTCACCATGCCCAGGTAATTTTTTTGTATTTTAGTAGAGATGGGGGTTTCACC 
ATATTGGCCAGTCTGGTCTCGAACTCTGACCTTGTGATCCATCGATCAGCCTCTCGAG 
5 TGCTGAGATTACACACGTGAGCAACTGTGCAAGGCCTGGTGTTTCTTGATACATGTAA 
TTCTACCAAGGTCTTCTTAATATGTTCTTTTAAATGATTGAATTATATGTTCAGATTA 
TTGGAGACTAATTCTAATGTGGACCTTAGAATACAGTTTTGAGTAGAGTTGATCAAAA 
TCAATTAAAATAGTCTCTTTAAAAGGAAAGAAAACATCTTTAAGGGGAGGAACCAGAG 
TGCTGAAGGAATGGAAGTCCATCTGCGTGTGTGCAGGGAGACTGGGTAGGAAAGAGGA 

1 0 AGO AAAT AG AAG AG AGAGGTTG AAAAAC AAAATGGGTT AC TTG ATTGGTGATT AGGTG 
GTGGTAGAGAAGCAAGTAAAAAGGCTAAATGGAAGGGCAAGTTTCCATCATCTATAGA 
AAGCTATATAAGACAAGAACTCCCCTTTTTTTCCCAAAGGCATTATAAAAAGAATGAA 
GCCTCCTTAGAAAAAAAATTATACCTCAATGTCCCCAACAAGATTGCTTAATAAATTG 
TGTTTCCTCCAAGCTATTCAATTCTTTTAACTGTTGTAGAAGACAAAATGTTCACAAT 

15 ATATTTAGTTGTAAACCAAGTGATCAAACTACATATTGTAAAGCCCATTTTTAAAATA 
CATTGTATATATGTGTATGCACAGTAAAAATGGAAACTATATTGACCTAAAAAAAAAA 
AAA 



SEQ ID NO:6 

20 

MDCRKMARFSYSVIWIMAISKVFELGLVAGLGHQEFARPSRGYLAFRDDSIWPQEEPA 

IRPRSSQRVPPMGIQHSKELNRTCCLNGGTCMLGSFCACPPSFYGRNCEHDVRKENCG 
SVPHDTWLPKKCSLCKCWHGQLRCFPQAFLPGCDGLVMDEHLVASRTPELPPSARTTT 
FMLVGICLSIQSYY 

25 

SEQ ID NO:7 

TGGGTAGGAAAGAGGAAGCAAAT 



30 SEQ ID NO:8 

TGCTTCTCTACCACCACCTAATCA 



35 



SEQ ID NO:9 

CGTCCAAGGCCGAAAGCCCTCCAGTT 



SEQ ID NO:10 
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TTGGGAGAGGGCAGGGCAAAGAAGTAAGAA 

SEQ ID NO:ll 

5 GHQEFARPSRGYL 

SEQIDNO:12 

QEEPAIRPRSSQRVPPMG 



10 



SVIWIMAISK 



20 

HQEFARPSR 



25 IRPRSSQRV 



IQHSKELNR 

30 



GRNCEHDVR 



SEQ ID NO: 13 



15 SEQ ID NO: 14 

SVPHDTWLPK 



SEQ ID NO: 15 



SEQIDNO:16 



SEQ ID NO: 17 



SEQ ID NO:18 
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SEQ ID NO:19 

VRKENCGSV 



5 SEQ ID NO:20 

LRCFPQAFL 



SEQ ID NO:21 

10 

SRTPELPPS 



SEQ ID NO:22 

15 QRVPPMGIQH 



SEQ ID NO:23 

NRTCCLNGGT 

20 

SEQ ID NO:24 

GQLRCFPQAF 



25 SEQ ID NO:25 

QAFLPGCDGL 



SEQ ID NO:26 

30 

DGLVMDEHLV 



SEQ ID NO:27 

35 FELGLVAGL 

-95- 



VU60111 



SEQ ID NO:28 

KELNRTCCL 



5 SEQ ID NO:29 

QLRCFPQAF 



SEQ ID NO:30 

10 

SARTTTFML 



SEQ ID NO:31 

15 KMARFSYSV 



SEQ ID NO:32 

IMAISKVFEL 

20 

SEQ ID NO:33 

VIWIMAISKV 



25 SEQ ID NO;34 

FMLVGICLSI 



SEQ ID NO:35 

30 

ARFSYSVIW 



SEQ ID NO:36 

35 CRKMARFSY 
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SEQ ID NO:37 

ARFSYSVIWI 



5 SEQ ID NO:38 

ARPSRGYLAF 



SEQ ID NO:39 

10 

ARTTTFMLV 



SEQ ID NO:40 

15 ARPSRGYLA 



SEQ ID NO:41 

CRKMARFSYS 

20 

SEQ ID NO:42 

QEFARPSRGY 



25 SEQ ID NO:43 

MARFSYSVI 



SEQ ID NO:44 

30 

SARTTTFMLV 



SEQ ID NO:45 

35 MAISKVFEL 
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SEQ ID NO:46 

FARPSRGYL 



5 SEQIDNO:47 

KMVRFSYSV 



SEQ ID NO:48 

10 

CMLESFCAC 



SEQ ID NO:49 

15 IMAISKAFEL 



SEQ ID NO:50 

FMLAGICLSI 

20 

SEQ ID NO:51 

VRFSYSVIW 



25 SEQ ID NO:52 

CRKMVRFSY 



SEQ ID NO:53 

30 

VRFSYSVIWI 



SEQ ID NO:54 

35 ARPSRGDLAF 
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SEQ ID NO:55 

ARPSRGDLA 



5 SEQ ID NO:56 

ARTTTFMLA 



SEQ ID NO:57 

10 

IRPRSSQRVL 



SEQ ID NO:58 

15 CRKMVRFSYS 



SEQ ID NO:59 

QRVLPMGIQH 

20 

SEQ ID NO:60 

RPRSSQRVL 



25 , SEQ ID NO:61 

MAISKAFEL 



SEQ ID NO:62 

30 

FARPSRGDL 



SEQ ID NO:63 

35 GICLSIQSY 
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SEQ ID NO:64 

LVAGLGHQEF 

5 SEQ ID NO:65 

DTWLPKKCSL 



SEQ ID NO:66 

10 

AIRPRSSQR 

SEQ ID NO:67 

15 VIWIMAISK 



SEQ ID NO:68 

CLNGGTCML 

20 

SEQ ID NO:69 

EFARPSRGY 

25 SEQ ID NO:70 

RVPPMGIQH 



SEQ ID NO:71 

30 

KVFELGLVA 



SEQ ID NO:72 

35 MLAGICLSI 
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SEQ ID NO:73 

RVLPMGIQH 



5 SEQ ID NO:74 

MGIQHSKEL 



SEQ ID NO:75 

10 

LVMDEHLVA 



SEQ ID NO:76 

15 LVASRTPEL 



SEQ ID NO:77 

YSVIWIMAI 

20 

SEQ ID NO:78 

WLPKKCSLC 



25 SEQ ID NO;79 

LGLVAGLGH 



SEQ ID NO:80 

30 

FRDDSIWPQ 



SEQ ID NO:81 

35 WIMAISKVF 
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SEQ ID NO:82 

MLVGICLSI 



5 SEQ ID NO:83 

VGICLSIQS 



SEQ ID NO:84 

10 

IWIMAISKV 



SEQ ID NO:85 

15 LVGICLSIQ 



SEQ ID NO:86 

FMLVGICLS 

20 

SEQ ID NO:87 

WIMAISKAF 



25 SEQ ID NO;88 

MVRFSYSVI 



SEQ ID NO:89 

30 

FMLAGICLS 



SEQ ID NO:90 

35 VLPMGIQHS 
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SEQ ID NO:91 

IWIMAISKA 



5 SEQ ID NO:92 

IMAISKAFE 



SEQ ID NO:93 

10 

GRNCEHDVRK 



SEQ ID NO:94 

15 SRTPELPPSA 



SEQ ID NO:95 

CGTCCAAGGCCGAAAGCCCTCCAGTTTCCCCTGGACGCCTTGCTCCTGCTTCTGCTAC 

20 GACCTTCTGGGGAAAACGAATTTCTCATTTTCTTCTTAAATTGCCATTTTCGCTTTAG 
GAGATGAATGTTTTCCTTTGGCTGTTTTGGCAATGACTCTGAATTAAAGCGATGCTAA 
CGCCTCTTTTCCCCCTAATTGTTAAAAGCTATGGACTGCAGGAAGATGGCCCGCTTCT 
CTTACAGTGTGATTTGGATCATGGCCATTTCTAAAGcCTTTGAACTGGGATTAGTTGC 
CGGGCTGGGCCATCAGGAATTTGCTCGTCCATCTCGGGGATACCTGGCCTTCAGAGAT 

25 GACAGCATTTGGCCCCAGGAGGAGCCTGCAATTCGGCCTCGGTCTTCCCAGCGTGTGC 
CGCCCATGGGGATACAGCACAGTAAGGAGCTAAACAGAACCTGCTGCCTGAATGGGGG 
AACCTGCATGCTGGGGTCCTTTTGTGCCTGCCCTCCCTCCTTCTACGGACGGAACTGT 
GAGCACGATGTGCGCAAAGAGAACTGTGGGTCTGTGCCCCATGACACCTGGCTGCCCA 
AGAAGTGTTCCCTGTGTAAATGCTGGCACGGTCAGCTCCGCTGCTTTCCTCAGGCATT 

30 TCTACCCGGCTGTGATGGCCTTGTGATGGATGAGCACCTCGTGGCTTCCAGGACTCCA 
GAACTACCACCGTCTGCACGTACTACCACTTTTATGCTAGTTGGCATCTGCCTTTCTA 
TACAAAGCTACTATTAATCGACATTGACCTATTTCCAGAAATACAATTTTAGATATCA 
TGCAAATTTCATGACCAGTAAAGGCTGCTGCTACAATGTCCTAACTGAAAGATGATCA 
TTTGTAGTTGCCTTAAAATAATGAATACAATTTCCAAAATGGTCTCTAACATTTCCTT 

35 ACAGAACTACTTCTTACTTCTTTGCCCTGCCCTCTCCCAA 

SEQ ID NO:96 

MDCRKMARFSYSVIWIJyEAISKAFELGLVAGLGHQEFARPSRGYLAFRDDSIWPQEEPA 
IRPRSSQRVPPMGIQHSKELNRTCCLNGGTCMLGSFCACPPSFYGRNCEHDVRKENCG 
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SVPHDTWLPKKCSLCKCWHGQLRCFPQAFLPGCDGLVMDEHLVASRTPELPPSARTTT 
FMLVGICLSIQSYY 



SEQ ID NO: 97 

5 CPPSFYGRNCEHDVRKE 
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